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Method for producing a nucleotide sequence construct with optimised codons for an HIV 
genetic vaccine based on a primary, early HIV isolate and synthetic envelope BX08 
constructs. 



5 Field of the invention 

The invention relates to a DNA vaccine against HIV, which is designed from a clinical 
primary isolate. One aspect of the invention relates to a method of producing a nucleotide 
sequence construct, in a prefered aspect based on a cassette system, the nucleotide 
sequence construct being used as a DNA vaccine. The method can, for example, lead to the 
10 disclosed synthetic BX08 HIV-1 envelope vaccine nucleotide sequence construct, designed 
to generate suitable DNA vaccines against HIV, specifically HIV-1. Furthermore, the 
invention can be used for the production of recombinant protein antigens. 



Bacicground of the invention 

15 There is an urgent need for new vaccine strategies against HIV. One such new promising 
strategy is called genetic immunisation or DNA vaccine (Webster et al 1997). Some of the 
advantages of a DNA vaccine against HIV is the induction of Th cell activation, induction of 
antibodies also against conformational dependent epitopes, and the induction of cellular 
immunity. So far, most DNA vaccine envelope genes tried, have been from tissue culture 

20 adapted virus strains (Boyer et al 1997) that often differs in several aspects from primary 
clinical isolates (such as early isolates) e.g. in co-receptor usage (Choe et al 1996, Dragic et 
al 1997). 

One disadvantage in HIV envelope based DNA vaccines may be the intrinsic relatively low 
25 expression which is regulated by the Rev expression. This may prevent an optimal 
investigation of the vaccines in small animal models like mice where Rev is functioning 
suboptimally. Recently it has been shown using the tissue culture adapted HIV-1 MN strain, 
that an exchange of the HIV codon usage to that of highly expressed mammalian genes 
greatly improves the expression in mammalian cell lines and renders the HIV expression Rev 
30 independent (Haas et al 1996). Additionally, it is known that rare codons cause pausing of 
the ribosome, which leads to a failure in completing the nascent polypeptide chain and a 
uncoupling of transcription and translation. Pausing of the ribosome is thought to lead to 
exposure of the 3' end of the mRNA to cellular ribonucleases. 
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The world-wide spread of HIV-1 has presently resulted in 8,500 new infections daily and 
AIDS is now the number 1 cause of death among US males (and number 3 among US 
females) aged 25-40 years. The epidemic hot-spots now include Eastern Europe, India and 
5 South East Asia and southern Africa. The attempts to solve this world-wide problem involve 
education, prevention, treatment and vaccine development. Affordable protective vaccines 
represent the best solution to the world-wide problem of infection with HIV-1. Induction of 
virus neutralising antibodies is one of the key components in vaccine development. Several 
recombinant envelope vaccines have been tested in humans and animals, however, they 

10 seem unable to induce sufficient protection. In this respect DNA vaccination may provide a 
different and more natural mode of antigen presentation. It is hoped that the immune 
responses induced by such DNA vaccines could aid in limiting virus replication, slowing 
disease progression or preventing occurrence of disease. Unfortunately many HIV envelope 
vaccines induce only moderate levels of antibodies. This could in part be due to limitations in 

15 expression, influenced by regulation by the Rev protein and by a species-specific and biased 
HIV codon usage. Also the virus variability is considered a barrier for development of 
antibody based vaccines and thus a tool for more easy producing of closely related vaccine 
variants is needed. 

20 It has been suggested to improve the immunogenicity and antigenicity of epitopes by certain 
mutations in the envelope gene. An elimination of certain immune dominant epitopes (like 
V3) could render less immune dominant but more relevant, conserved, or hidden epitopes 
more immunogenic (Bryder et al 1999). Also elimination of certain N-linked glycosylation 
sites could improve the exposure of relevant epitopes and increase the immunogenicity of 

25 those epitopes. Thus, it is possible that elimination of the glycosylation sites in VI and V2 
may in a more favourable way expose neutralising epitopes (Kwong et al 1998, Wyatt et al 
1998). The HIV envelope contains putative internalisation sequences in the intracellular part 
of gp41 (Sauter et al 1996). Thus it would be relevant to eliminate and/or mutate the 
internalisation signals in a membrane bound HIV envelope vaccine gene to increase the 

30 amount of surface exposed vaccine derived HIV glycoproteins as gp150. Since the antibody 
response, that is measured and calculated in titers, is improved by adding the secreted 
gp120 as opposed to adding the membrane bound form (Vinner et al 1999), it could be 
advantageous to express the vaccine as a secreted gp120 or a secreted gp140. This would 
include important parts of gp41, such as the 2F5 neutralising linear epitope (Mascola et al 

35 1997). 
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Summary of the invention 

Our suggested solution to the problems described above is to design DNA envelope 
vaccines from a clinical primary isolate with Rev-independent high expression in mammals, 
that is built as a cassette for easy variant vaccine production. 

5 

A method of producing a nucleotide sequence construct with codons from highly expressed 
mammalian proteins based on a cassette system coding for an early, primary HIV envelope 
is described. The method comprises the steps of direct cloning of an HIV gene, derived from 
a patient within the first 12 months of infection, thereby obtaining a first nucleotide sequence; 

1 0 designing a second nucleotide sequence utilising the most frequent codons from mammalian 
highly expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence; redesigning the second nucleotide sequence so that restriction enzyme sites 
surround the regions of the nucleotide sequence encoding functional regions of the amino 
acid sequence and so that selected restriction enzyme sites are removed, thereby obtaining 

15 a third nucleotide sequence encoding the same amino acid sequence as the first and the 
second nucleotide sequence; redesigning the third nucleotide sequence so that the terminals 
contain convenient restriction enzyme sites for cloning into an expression vehicle; producing 
snuts between restriction enzyme sites as well as terminal snuts and introducing snuts into 
an expression vehicle by ligation. The nucleotide sequence construct obtained by this 

20 method uses the mammalian highly expressed codons (figure 1) and renders the envelope 
gene expression Rev independent and allows easy cassette exchange of regions 
surrounded by restriction enzyme sites that are important for immunogenicity, function, and 
expression. 

25 The method can, for example, lead to the disclosed synthetic, Rev-independent, clinical 
(such as early), primary HIV-1 envelope vaccine gene, built as a multi cassette. From the 
sequence of the envelope of the HIV-1 BX08 isolate (personal communication from Marc 
Girard, Institute Pasteur, Paris), the present inventors have designed a synthetic BX08 HIV-1 
envelope vaccine nucleotide sequence construct. 

30 

With the great diversity of envelopes in HIV among different patients and within one patient, 
it would be of advantage to vaccinate with several envelope variants, all being highly 
expressed. To avoid synthesising several full length envelopes, it is much easier to 
exchange relevant parts of an envelope cassette to various strains in a multivalent vaccine. 

35 
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Whether it is the disclosed synthetic BX08 nucleotide sequence construct, or any of the 
nucleotide sequence constructs obtained by the method, they are designed to generate 
suitable DNA vaccines against HIV, specifically HIV-1. In this case the mammal, preferably a 
human being, is inoculated with the nucleotide sequence construct in an expression vehicle 
5 and constitutes a host for the transcription and translation of the nucleotide sequence 
construct. The nucleotide sequence constructs of the present invention can furthermore be 
used for the production of recombinant protein antigens. In this case the nucleotide 
sequence construct is placed in an expression vehicle and introduced into a system (e.g. a 
cell-line), allowing production of a recombinant protein with the same amino acid sequence. 
10 The recombinant protein is then isolated and administered to the mammal, preferably a 
human being. The immune system of the mammal will then direct antibodies against 
epitopes on the recombinant protein. The mammal, preferably a human being, can thus be 
primed or boosted with DNA and/or recombinant protein obtained by the method of the 
invention. 

15 

A relevant HIV DNA vaccine can potentially be used not only as a prophylactic vaccine, but 
also as a therapeutic vaccine in HIV infected patients, e.g. during antiviral therapy. An HIV 
specific DNA vaccine will have the possibility to induce or re-induce the wanted specific 
immunity and help the antiviral therapy in limiting or even eliminating the HIV infection. The 

20 immunogenicity and antigenicity of epitopes in the envelope can be improved by certain 
mutations in the envelope gene. The cassette system allows for easy access to the relevant 
parts of the envelope gene, and thereby eased efforts in the process of genetic manipulation. 
Some suggested mutations are: an elimination of certain immune dominant epitopes (like 
V3); elimination of certain N-linked glycosylation sites (like glycosylation sites around V2); 

25 elimination and/or mutation of the nucleotide sequence encoding the internalisation signals in 
the cytoplasmic part of a membrane bound HIV envelope to increase the amount of surface 
exposed vaccine derived HIV glycoproteins; elimination or mutation of the cleavage site 
between gp120 and gp41; with introduced mutations in gp41 for preserving conformational 
epitopes. 

30 

Table 1 below, lists the nucleotide sequence constructs of the invention by the names used 
herein, as well as by reference to relevant SEQ ID NOs of DNA sequences, and the amino 
acid sequence encoded by the DNA sequence in the preferred reading frame. It should be 
noted, that the snut name consist of the number of the approximate position for the end of 
35 the snut and the restriction enzyme used to cleave and/or ligate that end of the snut. 
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Table 1 List of names of nucleotide sequence constructs (Snuts (S) and Pieces (P)) 
with reference to SEQ ID NO for the nucleotide sequence and protein sequence. 



Name 


Nucleotide SEQ ID 


Protein SEQ ID 




NO: 


NO: 




1 


2 


^ZootCOKV 


3 


4 


^375PStl 


5 


6 




7 


8 


W650-720ECORI 


9 


10 


^^nnn A i 

^yOOADai 


11 


12 


SggoSacI 


13 


14 


Q 

^mospei 


15 


16 


^1265Xnol 


17 


18 


Q 

'^1265gp120 


19 


20 


Q 

*^1265gp160 


21 


22 


o 

Oi465Pstl 


0^ 


24 


Si465Pstl cys 


oc 


26 


Si630Xbal 


^ 1 


28 


Q 

<^1700Eagl 


29 


30 


*^1890Hinaiil 


31 


32 


Q 

O2060Saclt 


33 


34 


^ziyuLriai 


35 


36 


^2330PStl 


37 


38 


^2425t5 


39 


40 


P. 
"1 


41 


42 


'2 


43 


44 


* 3 


45 


46 




47 


48 


•3 Gv1V2 


49 


50 


P3GV2 


51 


52 


P4gp160 


53 


54 


P4gp150 


55 


56 


■ 4gpi4u 


57 


58 


Ps 


59 


60 


Psgpieo 


61 


62 


Pegpiso 


63 


64 


P8gp140 


65 


66 


synBX08-140 


67 


68 


synBX08-150 


69 


70 


synBX08-160 


71 


72 


synBX08-120 


73 


74 


synBX08-41 


75 


76 
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Detailed disclosure of the invention 

One aspect of the present invention relates to a method for producing a nucleotide sequence 
construct coding for an HIV gene. The nucleotide sequence construct is produced as a 
cassette system consisting of snuts. A snut (S) is a nucleotide sequences construct between 
5 restriction enzyme cleavage sites comprising the minimal entity of the cassette system. 

First an HIV gene is obtained from a patient within the first 12 months of infection. The term 
HIV should be understood in the broadest sense and include HIV 1 and HIV 2. It is possible 
to determine the period in which the infection has taken place with an accuracy depending 

10 on the frequency of the blood tests taken from the patient. For example, patients suffering 
from various diseases such as lack of certain factors in their blood or hepatitis have their 
blood tested on a regular basis making it possible to determine the period in which the 
infection has taken place. Apart from patients with diseases wherein blood tests are used to 
monitor the course of the disease, other groups of patients have blood tests taken, e.g. blood 

15 donors. Unfortunately, humans are still infected due to transfer of virus in blood samples, 
medical equipment, etc., making it possible to determine the date where the infection has 
taken place within the time frame of a few days. The importance of obtaining the virus early 
in the course of the infection is due to the known fact that many early isolates share the 
common feature of staying relatively constant in their envelope sequences (Karlsson et al., 

20 1998). As these early isolates may share cross-reactive antibody- and/or T-cell epitopes a 
vaccine based on such early isolates would have a better chance of inducing immune 
response to shared epitopes of the virus. It is believed that an early, directly cloned virus 
isolate will share immunogenic sites with other early virus isolates seen during an HIV 
infection, so that if a mammal generates antibodies and/or T-cells directed against these 

25 epitopes, the transferred virus will be eliminated prior to the extensive mutations that may 
occur after approximately 12 months of infection. Thus, the virus should be isolated as early 
as possible, that is within the first 12 months of infection, such as 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 
1, or 0.5 month after infection. 

30 The HIV gene for genetic vaccine is preferably cloned directly from viral RNA or from proviral 
DNA. Direct cloning in this application stands for the virus not being multiplied in stable cell 
lines in vitro. It is presently expected that passing the virus through a stable cell line will 
promote mutation in the virus gene. It is particularly preferred not to pass the virus through 
cells lines selecting for viruses with CXCR4 receptor usage. Direct cloning also includes 

35 multiplication of virus in e.g. PBMC (peripheral blood mononuclear cells) since all virus can 
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multiply in PBMC, and this type of multiplication generally does not select for CXCR4 
receptor usage. Multiplication of virus is often necessary prior to cloning. Preferably cloning 
is performed directly on samples from the patient. In one embodiment of the invention, 
cloning is performed from patient serum. The cloning is then performed directly on the HIV 
5 virus, that is on RNA. In another embodiment of the invention cloning, is performed from 
infected cells. The cloning is then performed on HIV virus incorporated in the genes in an 
infected cell (e.g. a lymphocyte), that is on DNA. In the latter case the virus might be a silent 
virus, that is a non-repiicating virus. To evaluate if the virus is silent, capability of 
multiplication in e.g. PBMC is tested. 

10 

Cloning is a technique well known to a person skilled in the art. A first nucleotide sequence is 
hereby obtained. In another aspect of the invention, the first nucleotide sequence, sharing 
the properties mentioned with direct cloning, is obtained by other means. This could be from 
a database of primary isolates or the like. 

15 

Based on the first nucleotide sequence, the amino acid sequence encoded by said 
nucleotide sequence is determined. A second nucleotide sequence encoding the same 
amino acid sequence is then designed utilising the most frequent codons from highly 
expressed proteins in mammalians (e.g. figure 1 presenting the most frequent codons from 
20 highly expressed proteins in humans). 

Presently, it appears that the usage of the most frequent codons from mammalian highly 
expressed proteins has two advantages: 1) the expression is Rev independent; 2) the level 
of expression is high. The Rev independence is especially advantageous when performing 

25 experiments in mice where the Rev systems Is functioning sub-optimally. For the use in 
human vaccine, Rev independence and high expression are important to increase the 
amount of antigen produced. The determination of the codons for high expression is in this 
context based on the statistics from human highly expressed proteins (Haas, Park and Seed, 
1996 hereby incorporated by reference). It is contemplated that the expression of a protein 

30 can be even higher, when current research in binding between codon (on the mRNA) and 
anticodon (on the tRNA) reveals codons with optimal binding capabilities, and when 
interactions in-between codons and/or in-between anticodons are known. 

The second nucleotide sequence designed utilising optimised codons is then redesigned to 
35 obtain a third nucleotide sequence. The purpose of the redesigning is to create unique 
restriction enzyme sites around the nucleotide sequence encoding functional regions of the 
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amino acid sequence. By having unique restriction enzyme sites around the nucleotide 
sequence encoding functional regions of the amino acid sequence, the nucleotide sequence 
encoding functional regions of the amino acid sequence can easily be isolated, changed, and 
re-inserted. Examples of functional regions of the amino acid sequence are transmembrane 
5 spanning regions, immunodominant regions, regions with antibody cross reacting domains, 
fusion domains and other regions important for immunogenicity and expression such as 
variable region 1 (V1), variable region 2 (V2), variable region 3 (V3), variable region 4 (V4) 
and variable region 5 (V5). 

10 It is important to select the restriction enzymes sites with care. By changing the second 
nucleotide sequence to insert restriction enzyme sites around the nucleotide sequence 
encoding functional regions of the amino acid sequence, the third nucleotide sequence must 
still code for the same amino acid sequence as the second and first nucleotide sequence do. 
Thus, if necessary, the second nucleotide sequence is redesigned by changing from 

15 optimised codons to less optimal codons. It is understood, that the restriction enzyme sites 
around the nucleotide sequence encoding functional regions of the amino acid sequence 
should preferably be placed in the terminal region of the nucleotide sequence encoding 
functional regions of the amino acid sequence. That Is preferably outside the nucleotide 
sequence encoding functional regions of the amino acid sequence, such as 90 nucleotides 

20 away, e.g. 81, 72, 63, 54, 45, 36, 27, 21, 18, 15, 12, 9, 6,3 nucleotides away, but could also 
be inside the nucleotide sequence encoding functional regions of the amino acid sequence, 
such as 54, 45, 36, 27, 21, 18, 15, 12, 9, 6, 3 nucleotides Inside the nucleotide sequence 
encoding the functional region of the amino acid sequence. 

25 The type of restriction enzyme sites allowed Is determined by the choice of expression 

vector. In certain cases, the number of restriction enzyme sites Is limited and it Is hard. If not 
impossible, to place unique restriction enzyme sites around ail the nucleotide sequences 
coding for functional regions of the amino acid sequence. This problem can be solved by 
dividing the entire nucleotide sequence Into pieces, so that each piece comprises only 

30 unique restriction enzyme sites. Modifications to each of the piece is performed separately 
prior to assembly of the pieces. It Is preferred that the nucleotide sequence Is divided into 9 
pieces. In another aspect, the nuclotlde sequence Is divided Into 8 pieces, or 7, or 6, or 5. or 
4, or 3, or 2 pieces. It is especially preferred that the nucleotide sequence Is divided Into 3 
pieces. 
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Thus, the redesign of the second nucleotide sequence is an interaction between the choice 
of cloning vector, expression vector, selection of restriction enzyme sites, dividing into 
pieces, and exchange of codons to insert restriction enzyme sites. In a preferred 
embodiment of the present invention the cloning vector is Bluescript allowing the restriction 
5 enzyme sites chosen from the group consisting of: EagI, Mul, EcoRV, Pstl, C/al, EcoRI, 
Xba\, Sad, Spel, Xho\, HindWl Sacll, A/ofI, BamHI, Smal, Sa/I, Dral, Kpnl. If other cloning 
vectors are chosen, other restriction enzyme sites will be available as known by the person 
skilled in the art. 

10 As a part of the redesigning of the second nucleotide sequence, selected restriction enzyme 
sites may be removed. The selected restriction enzyme sites to be removed are those sites 
that are sites of the same type as the ones already chosen above and that are placed within 
the same piece. The removal of these restriction enzyme sites is performed by changing 
from optimised codons to less optimal codons, maintaining codons for the same amino acid 

15 sequence. 

The third nucleotide sequence is redesigned so that the terminal snuts contain convenient 
restriction enzyme sites for cloning into an expression vehicle. The expression "vehicle" 
means any nucleotide molecule e.g. a DNA molecule, derived e.g. from a plasmid, 

20 bacteriophage, or mammalian or insect virus, into which fragments of nucleic acid may be 
Inserted or cloned. An expression vehicle will contain one or more unique restriction enzyme 
sites and may be capable of autonomous replication in a defined host or vehicle organism 
such that the cloned sequence is produced. The expression vehicle is an autonomous 
element capable of directing the synthesis of a protein. Examples of expression vehicles are 

25 mammalian plasmids and viruses, tag containing vectors and viral vectors such as 

adenovirus, vaccinia ankara, adenoassociated virus, cannarypox virus, simliki forest virus 
(sfv), Modified Vaccinia Virus Ankara (MVA), and simbis virus. In one embodiment of the 
invention, the expression vector contains tag sequences. In another embodiment of the 
invention a bacteria is transformed with an expression plasmid vector and the bacteria is 

30 then delivered to the patient. Preferred expression vehicles are simliki forest virus (sfv), 
adenovirus and Modified Vaccinia Virus Ankara (MVA). 

The snuts are produced by techniques well known by the person skilled in the art. The 
preferred method for synthesising snuts, is herein referred to as "the minigene approach" 
35 wherein complementary nucleotide strands are synthesised with specific overhanging 
sequences for annealing and subsequent ligation into a vector. This can be performed with 
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two sets of complementary nucleotide strands, or with three sets of complementary 
nucleotide strands. The minigene approach minimises the known PGR errors of mismatches 
and/or deletions, which may occur due to hairpins in a GC rich gene with mammalian highly 
expressed codons. In figures 10-21, the production of a representative selection of snuts is 
5 illustrated. 

For the production of long snuts, that is snuts of more than about 240 nucleotides, the 
technique of overlapping PGR is preferred as illustrated in figure 8. Herein two nucleotide 
strands about 130 nucleotides long with an overlap are filled to obtain a double strand, which 
10 is subsequently amplified by PGR. 

For the production of multiple snuts with a length of less than about 210 nucleotides, one 
preferred technique is normal PGR. In a preferred production technique the snuts are 
synthesised with the same 5* flanking sequences and with the same 3' flanking sequences, 
15 as illustrated in figure 9. The advantages of this approach is, that the same PGR primer set 
can be used for amplification of several different snuts. 

As known by the person skilled in the art, special conditions have to be used for each 
individual PGR reaction and it should be optimised to avoid inherent problems like deletions 
20 mismatches when amplificating GG rich genes from synthetic ssDNA material. Whichever of 
the above mentioned techniques are used, it is well known by the person skilled in the art, 
that it will be necessary to correct unavoidable mismatches produced either due to the 
nucleotide strand synthesis material and/or the PGR reaction. This can be performed by site 
directed mutagenesis techniques. 

25 

After the various snuts have been produced, they are assembled into pieces and 
subsequently into the complete gene. Methods for assembly (such as ligation) are well 
known by the person skilled in the art. 

30 

In a preferred embodiment of the present invention the HIV gene encodes the entire HIV 
envelope. It is understood that the HIV envelope can be the full length envelope gp160 as 
well as shorter versions such as gp150. gp140, and gp120 with or without parts of gp41. 



35 



As will be known by the person skilled in the art, the HIV is divided into several groups. 
These groups presently include group M, group O, and group N. Further, the HIV is divided 
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into subtypes A, B, C, D, E, F, G, H, I, and J. In the present invention subtype B is preferred 
due to the high prevalence of this subtype in the Western countries. 



The determination of groups and subtypes is based on the degree of nucleotide sequence 
5 identity in the envelope gene is presently defined as follows: If the sequence identity is more 
than 90% the viruses belong to the same subtype; If the sequence identity is between 80% 
and 90% the viruses belong to the same group. If the sequence identity is less than 80% the 
viruses are considered as belonging to different groups. 



10 One aspect of the invention relates to a nucleotide sequence construct in isolated form which 
has a nucleotide sequence with the general formula (I), (II), (III), or (IV) 

(I) Pl-S495ClarS650-720EcoRrP2-Si265gp120 

(II) Pl-S495ClarS650-720EcoRrP2'"Si265Xhor Si465Pstr P4gp140 

(III) Pl-S495ClarS650-720EcoRrP2"Si265Xhor Si465Pstr P4gp150 

15 (IV) Pl-S495ClarS650-720EcoRrP2"*Si265Xhor Si465Pstr P4gp160- S2060Sacir Ps 

wherein Pi designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein 84950131 designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
20 complementary thereto, or a nucleotide sequence with a sequence identity of at least 95% 
thereto; 

wherein S65o-72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

25 wherein P2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein Si265gpi2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
30 least 70% thereto; 

wherein Si265Xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein Si465Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
35 complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 
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wherein P4gpi4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P4gpi5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P4gpi6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S206osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 

wherein P5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
15 thereto. 

The design of the parent synthetic BX08 gplSO envelope cassette gene with its variant 
length genes gp150, gp140, gp120 is outlined in figure 2. 

20 The nucleotide sequence construct with the formula (I) 

( I ) Pi -S495Clar S650-720EcoRr Pa'S 1 265gp1 20 

(visualised in figure 3) (SEQ ID NO: 73) codes for the amino acid sequence of gp120 (SEQ 
ID NO: 74). This amino acid sequence is the part of the HIV envelope that is secreted. Thus, 
it contains the immunogenic epitopes without being bound to the cell membrane. This is of 
25 particular advantage if the nucleotide sequence construct is used for production of 

recombinant antigens or for a DNA vaccine as the antibody immune response may be higher 
to secreted versus membrane bound HIV antigens. 

The nucleotide sequence construct with the formula (II) 
30 (II) 

Pl-S495ClarS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp140 

(visualised in figure 4) (SEQ ID NO: 67) codes for the amino acid sequence of gp140 (SEQ 
ID NO: 68). This amino acid sequence encodes the gp120 and the extracellular part of the 
gp41 protein. The amino acid sequence is secreted due to the lack of the transmembrane 
spanning region. This is of particular advantage if the nucleotide sequence construct is used 
35 for production of recombinant antigens as the immunogenic and/or antigenic epitopes in the 
extracellular part of gp41 are included and is of particular advantage for a DNA vaccine as 
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the antibody immune response may be higher to secreted gp120 versus membrane bound 
HIV antigens. 

The nucleotide sequence construct with the formula (III) 

5 (III) Pl-S495ClarS650-720EcoRrP2-Si265Xhor Si466Pstr P4gp150 

(visualised in figure 5) (SEQ ID NO: 69) codes for the amino acid sequence of gp150 (SEQ 
ID NO: 70). This amino acid sequence contains all of the envelope protein gp160 except the 
c-terminal tyrosin containing internalisation signals in the intracellular part of gp41. The 
membrane bound surface expression of the amino acid sequence is thereby maintained and 
10 enhanced. Mimicking the organisation of the native epitope conformation may by expected, 
making this nucleotide sequence construct of particular advantage if the nucleotide 
sequence construct is used as a vaccine. 

The nucleotide sequence construct with the formula (IV) 

15 (IV) Pl-S495ClarS660-720EcoRrP2-Si265Xhor Si465Pstr P4gp160- S2060Sacir Ps 

(visualised in figure 6) (SEQ ID NO: 71) codes for the amino acid sequence of gp160 (SEQ 
ID NO: 72) i.e. the entire envelope. 

The nucleotide sequence construct designated Pi comprises the nucleotide sequence 
20 encoding the amino acid sequence in the first variable region (VI) and the amino acid 
sequence in the second variable region (V2). In one embodiment of the invention the first 
variable region is surrounded by EcoRV and PstI restriction enzyme sites, and the second 
variable region is surrounded by PstI and Clal restriction enzyme sites but as stated above, 
the choice of restriction enzyme sites can alter. 

25 

The nucleotide sequence construct designated S65o-72oecori comprises the nucleotide 
sequence encoding the amino acid sequence in the third variable region (V3). In one 
embodiment of the present invention S65o-72oecori is characterised by the restriction enzyme 
sites EcoRI and Xbal in the terminals. 

30 

The nucleotide sequence construct designated P2 comprises the nucleotide sequence 
encoding the amino acid sequence of the fourth variable and constant region (V4 and C4). In 
one embodiment of the present invention the forth variable region is surrounded by Sad and 
Xho\ restriction enzyme sites. 

35 
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The nucleotide sequence construct designated Si265gpi2o comprises the nucleotide sequence 
encoding amino acid sequence of the fifth variable and constant region {V5 and C5). 
Si265gpi2o furlher comprises a nucleotide sequence encoding a C-terminal stop codon. 

5 The nucleotide sequence construct designated P4gpi4o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region. P4gpi4o further 
comprises a nucleotide sequence encoding a C-terminal stop codon prior to the 
transmembrane spanning region. 

10 The nucleotide sequence construct designated P4gpi6o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region (trans membrane 
spanning domain: TMD). In a preferred embodiment of the present invention the 
transmembrane spanning region is surrounded by Hindlll and Sacll restriction enzyme sites. 

15 The term "sequence identity" indicates the degree of identity between two amino acid 

sequences or between two nucleotide sequences calculated by the Wilbur-Lipman alignment 
method (Wilbur et al, 1983). 

The nucleotide sequence constructs with the formula (I), (II), (III), or (IV) illustrates the 
20 flexibility in the present invention. By producing a gene with the described method enables 
the production of a plethora of antigens with various immunogenic epitopes and various 
advantages for production and vaccine purposes. To further illustrate the flexibility of the 
invention, other changes and mutations are suggested below. 

25 In order to improve the immunogenicity of the nucleotide sequence constructs of the 
invention it is suggested to change the nucleotide sequence such that one or more 
glycosylation sites are removed in the amino acid sequence. By removal of shielding 
glycosylations, epitopes are revealed to the immunesystem of the mammal rendering the 
construct more immunogenic. The increased immunogenicity can be determined by an 

30 improved virus neutralisation. Changes in the nucleotide sequence such that one or more N- 
linked glycosylation sites are removed in the amino acid sequence is well known by the 
person skilled in the art. Potential glycosylation sites are N in the amino acid sequences N-X- 
T or N-X-S (wherein X is any amino acid besides P). The glycosylation site can be removed 
by changing N to any amino acid, changing X to a P, or changing T to any amino acid. It is 

35 preferred that N is changed to Q by an A to C mutation at the first nucleotide in the codon, 
and a C to G mutation at the third nucleotide in the codon. This is preferred to increase the 
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GC content in the nucleotide sequence construct. As an alternative N is changed to Q by an 
A to C mutation at the first nucleotide in the codon, and a C to A mutation at the third 
nucleotide in the codon. Preferred mutations in the synthetic BX08 envelope gene to remove 
potential N-linked glycosylation sites in V1 and/or V2 are A307C + C309A and/or A325C + 
5 C327G and/or A340C + C342A and/or A385C + C387A and/or A469C + C471 A. Examples 
of such changes is illustrated in SEQ ID NOs: 47, 49, and 51. 

For historical reasons the HIVs have been divided into syncytia inducing strains and non 
syncytia inducing strains. The assay to determine whether a strain Is syncytia inducing is 

10 described in Verrier et al 1997, hereby incorporated by reference. It is presently known, that 
viruses utilising the CXCR4 co-receptor are syncytia inducing strains. It is also, at the 
present, known that the binding site for the CXCR4 involves the third variable region (V3). In 
a preferred embodiment the nucleotide sequence construct is changed to create a binding 
site for the CXCR4 co-receptor. It is presently performed in the third variable regions, 

15 preferably by the mutation G865C + A866G. 

It is well established that the HIV envelope comprises immunodominant epitopes. An 
immunodominant epitope is an epitope that most antibodies from the mammal are directed 
against. The antibodies directed against these immunodominant epitopes may have little 

20 effect in elimination of the virus. It is therefore anticipated that modification of the 

immunodominant epitopes will induce antibodies directed against other parts of the envelope 
leading to a better elimination and neutralisation of the virus. By modification is understood 
any change in the nucleotide sequence encoding an immunodominant epitope in the amino 
acid sequence such that said amino acid sequence no longer contains an immunodominant 

25 epitope. Thus, modification includes removal of the immunodominant epitope and decrease 
of immunogenicity performed by mutagenesis. In a preferred embodiment of the present 
invention an immunodominant epitope in the third variable region (V3) is modified, such as 
deleted or altered. In a much preferred embodiment the nucleotides 793-897 are deleted. 
In yet another preferred embodiment of the present invention an immunodominant epitope 

30 has been removed from gp41, such as deleted. This is performed in P7 or Re by elimination 
of the nucleotides 1654-1710. 

It is anticipated that when gp120 is dissociated from gp41 in a vaccine or antigen, two 
immunodominant epitopes, one on each protein, are exposed and antibodies are directed 
35 against these in the mammal. In the infectious virus, gp120 is coiled on top of gp41 and the 
gp120/gp41 is most likely organised in a trimer, so that these immunodominant epitopes are 
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hidden and therefore less elimination of virus is observed. By removing the cleavage site 
between gp41 and gp120 a full length gp160, gp150, or gp140 can be obtained with a 
covalent binding between gp41 and gp120. Removal of the cleavage site between gp41 and 
gp120 is preferably performed by a mutation at position C1423A. An example of such a 
5 mutation is illustrated in the mutation of Si265Xhoi (SEQ ID NO: 17) to Si265gpi6o (SEQ ID NO: 
21). 

In order to stabilise the full length gp160, gp150, and gp140 for example when the cleavage 
site between gp41 and gp120 has been removed as described above, cysteins can be 
10 inserted, preferably inside the gp41 helix creating disulphide bounds to stabilise a trimer of 
gp41s. In a preferred embodiment of the present invention the cysteins are inserted by the 
mutation 1618:CTCCAGGC:1625 to 1618:TGCTGCGG:1625. An example of such a change 
is illustrated in SEQ ID NO: 25. 

15 The above mentioned decrease in immunodominant epitopes combined with the increase in 
immunogenicity of the other epitopes is expected to greatly enhance the efficacy of the 
nucleotide sequence construct as a vaccine. 

During the production of the nucleotide sequence construct, it is convenient to ligate the 
20 snuts into pieces. The pieces, as described above, are characterised by their reversible 
assembly as there are no duplicate restriction enzyme sites. In a preferred embodiment one 
piece (herein designated P3) contains Pi, 84950131, S65o-72oecori, and P2. Another piece (herein 
designated Ps) contains S 1265Xho(j Si465Pstlj and P4gp160- Yet another piece (herein designated 

P7) contains Si265Xhot7 Si465Pst!i P4gp160, S2060Sad!, ^^id P5. 

25 

One advantage of the present nucleotide sequence construct is the easy access to 
exchange and alterations in the content and function of the nucleotide sequence and the 
encoded amino acid sequence. In one embodiment the nucleotide sequence coding for a 
functional region or parts thereof of the amino acid sequence is repeated. The repeat could 
30 be back-to-back or a functional region or parts thereof could be repeated somewhere else in 
the sequence. Repeated could mean two (one repetition) but could also be three, six, or nine 
repeats. In a much preferred embodiment the repetition nucleotide sequence codes for 
amino acids in the third variable region. 



35 



In order to improve the protective capabilities of the invention against infections with HIV, 
one embodiment of the invention relates to the combination of epitopes. The present 
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nucleotide sequence construct allows insertion of one or more new nucleotide sequences 
isolated from another group and/or subtype of HIV and/or isolated from another patient. 
Hereby a vaccine or antigen with two or more epitopes from two or more HIVs is obtained. In 
a preferred embodiment, the V3 is replaced by the new nucleotide sequence. In a much 
5 preferred embodiment, the new nucleotide sequence codes for amino acids in the third 
variable region of a different HIV isolate. 

In order to improve the efficacy of the vaccine, aiming at raising cellular immunity, a 
nucleotide sequence coding for a T-helper cell epitope is included in the nucleotide 

10 sequence construct. The nucleotide sequence coding for a T-helper cell epitope or a T- 
helper cell epitope containing amino acid sequence can be put in anywhere in the nucleotide 
sequence construct as long as it does not interact with the function of the envelope molecule. 
However, it Is preferably placed in the tail of the nucleotide sequence construct or between 
the leader sequence and the envelope gene. The T-helper epitopes are preferably selected 

15 from core proteins such as P24gag or from a non-HIV pathogen such as virus, bacteria, e.g. 
BCG antigen 85. For a therapeutic vaccine an HIV helper epitope is preferred since the 
patient is already primed by the HIV infection. For a prophylactic vaccine, a T-helper cell 
epitope from a frequently occurring non HIV pathogen such as Hepatitis B, BCG, CMV, EBV 
is preferred. Also, since the synthetic BX08 envelope genes may contain T-helper cell 

20 epitopes in addition to important antibody epitopes, the synthetic BX08 vaccine genes can be 
mixed with other DNA vaccines to improve the efficacy of the other DNA vaccine. 

One aspect of the present invention relates to individualised immunotherapy, wherein the 
virus from a newly diagnosed patient is directly cloned, the envelope or subunits 

25 corresponding to snuts or pieces is produced with highly expressed codons, inserted into any 
of the nucleotide sequence constructs described above and administered to the patient as a 
vaccine. Hereby a therapeutic DNA vaccine is obtained, that will help the patient to break 
immunetolerance or induce/reinduce an appropriate immune response. In one embodiment 
the variable regions of the virus are produced with highly expressed codons and exchanged 

30 into any of the nucleotide sequence constructs described above. 

In one embodiment of the invention, the nucleotide sequence construct as described above 
satisfies at least one of the following criteria: 

a) serum extracted from a Macaque primate which has been immunised by administration of 
35 an expression vector containing the nucleotide sequence construct is capable of eliminating 
SHIV as determined by quantitative PGR and/or virus culturing. 
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b) serum extracted from a primate which has been immunised by administration of an 
expression vector containing the nucleotide sequence construct is capable of neutralising 
HIV-1 BX08 and /or other HIV-1 strains in vitro. 

c) serum, extracted from a mouse which has been immunised by administration of an 

5 expression vector containing the nucleotide sequence construct four times in Intervals of 
three weeks and boosted after 15 weeks, is capable of decreasing the concentration of HIV- 
antigen in a culture of HIV, serum or PBMCs by at least 50%. An example of such procudure 
is shown in example 9. 

10 In one embodiment of the invention, the nucleotide sequence construct of the invention, is 
used in medicine. That is, it Is used as a vaccine, for the production of a recombinant protein, 
such that the recombinant protein is used as a vaccine, or the nucleotide sequence construct 
or the recombinant protein is used in a diagnostic composition. 

Thus, the nucleotide sequence construct of the invention can be used for the manufacture of 
15 a vaccine for the prophylactics of infection with HIV in humans. 

Intramuscular inoculation of nucleotide constructs, i.e. DNA plasmids encoding proteins have 
been shown to result in the generation of the encoded protein In situ In muscle cells and 
dendritic cells. By using cDNA plasmids encoding viral proteins, both antibody and CTL 

20 responses were generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain reaction, respectively. 
The standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the DNA therapeutics of this invention. While standard techniques 
of molecular biology are therefore sufficient for the production of the products of this 

25 Invention, the specific constructs disclosed herein provide novel therapeutics which can 
produce cross-strain protection, a result heretofore unattainable with standard Inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be Introduced to a vaccine recipient will depend on the 
strength of the transcription and translation promoters used In the DNA construct, and on the 

30 Immunogenicity of the expressed gene product. In general, an immunologically or 

prophylactically effective dose of about 10 ing to 300 ^g Is administered directly Into muscle 
tissue. Subcutaneous injection, intradermal Introduction, impression through the skin, 
inoculation by gene gun preferably DNA coated gold particles, and other modes of 
administration such as intraperitoneal, intravenous, peroral, topic, vaginal, rectal, intranasal 

35 or by inhalation delivery are also contemplated. It Is also contemplated that booster 
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vaccinations are to be provided. It is further contemplated that booster vaccinations with 
recombinant antigens are to be provided, administered as described above. 

The DNA may be naked, that is, unassociated v^^ith any proteins, adjuvants or other agents 
5 which impact on the recipients immune system. In this case, it is desirable for the DNA to be 
in a physiologically acceptable solution, such as, but not limited to, sterile saline or sterile 
buffered saline. Alternatively, the DNA may be associated with surfactants, liposomes, such 
as lecithin liposomes or other liposomes, such as ISCOMs, known in the art, as a DNA- 
liposome mixture, (see for example WO93/24640) or the DNA may be associated with and 
10 adjuvant known in the art to boost immune responses, such as a protein or other carrier. 
Agents which assist in the cellular uptake of DNA, such as, but not limited to, calcium ions, 
detergents, viral proteins and other transfection facilitating agents may also be used to 
advantage. These agents are generally referred to as transfection facilitating agents and as 
pharmaceutically acceptable carriers. 

15 

Those skilled in the field of molecular biology will understand that any of a wide variety of 
expression systems may be used. A wide range of suitable mammalian cells are available 
from a wide range of sources (e.g. the American Type Culture Collection, Rockland, Dm; 
also, see e.g. Ausubel et al. 1992). The method of transfomnation or transfection and the 
20 choice of expression vehicle will depend on the host system selected. Transfomnation and 
transfection methods are described e.g. in Ausubel et al 1992; expression vehicles may be 
chosen from those provided e.g. in P.H. Pouwels et al. 1985. 

In one embodiment of the present invention the protein encoded by the nucleotide sequence 
25 construct is produced by introduction into a suitable mammalian cell to create a stably- 
transfected mammalian cell line capable of producing the recombinant protein. A number of 
vectors suitable for stable transfection of mammalian cells are available to the public e.g. in 
Cloning Vectors: A Laboratory manual (P.H. Pouwels et al. 1985); methods for constructing 
such cell lines are also publicly available, e.g. in Ausubel et al. 1992. 

30 

Standard reference works describing the general principles of recombinant DNA technology 
include Watson, J.D. et al 1987; Darnell, J.E. et al 1986; Old, R.W. et al, 1981; Maniatis.T. et 
al 1989; and Ausubel et al.1992. 
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Figure legends 

The invention is further illustrated in the following non-limiting examples and the drawing 
wherein 

5 Figure 1 provides the codon preference of highly expressed proteins in human cells. 

Figure 2 illustrates the outline of gp120, gp140, gp150, and gp160 encoding synthetic genes 
derived from the wild type sequence at the top. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain 
10 (TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 
The approximate position of the three restriction enzyme sites dividing the full-length 
gp160 gene into the three pieces each containing only unique restriction enzyme sites are 
shown in bold. 

15 Figure 3 building of the synthetic gp120 gene. Variable (V) and constant (C) regions are 

shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 4 building of the synthetic gp140 gene. Variable (V) and constant (C) regions are 
20 shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 5 building of the synthetic gp150 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
25 TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 6 building of the synthetic gp160 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

30 

Figure 7 illustrates the codons coding amino acids in general 
Figure 8 illustrates how overlapping PGR is performed. 
35 Figure 9 illustrates how PGR using conserved flanking ends is performed. 
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Figure 10 illustrates how Si265Xhoi is produced using complementary strands (minigene- 
approach) technology. The Siaesxhoi is ligated from three sets of complementary strands 
into the vector pBluescript KS* between restriction enzyme sites Xho\ and Pst\. 

5 

Figure 1 1 illustrates how Si465Psti is produced. The same approach, as the approach used for 
the production of Si265Xhoi, was used except that only two sets of complementary strands 
were used. 

10 Figure 12 illustrates the assembly of Pi. The So-N iang and SassEcoRv are ligated into the Xba\ 
and Pstl site of the S375psti containing plasmid. 

Figure 13 illustrates the assembly of P2. The Sgooxbai was excerted by HindlW and Sad from 
its plasmid and ligated with Sggosaci (Sacl-Spel) into the Snospai plasmid that was opened 
15 at the HindWl and Spel sites. 

Figure 14 illustrates the assembly of P3. 84950131 (C/al-EcoRI) and S650.720EC0R1 (EcoRI-Xftal) 
and P2 (Xba\-Xhol) were ligated simultaneously into the P^ plasmid opened at the C/al 
and Xhol sites to obtain the P3 plasmid. 

20 

Figure 15 illustrates the assembly of P4gpi6o. SiaooHindm (Sacl-H/nc/ljl) and Snooeagi {HindlW- 
Eag\) were ligated simultaneously into the Sieaoxhai plasmid opened by Sacll and Eagl. 

Figure 16 illustrates the assembly of P5. S2i9ociai (C/al-Psfl) and S2330Psti {Pstl-EcoRl) were 
25 ligated into the S2425ES plasmid opened by C/al and EcoRI. 

Figure 17 illustrates the assembly of Psgpieo. Si465Psti (Xbal-Pstl) and Si265Xhoi {Pst\-Xho\) were 
ligated into the P4gpi6o plasmid opened by Xba\ and Xho\. 

30 Figure 18 illustrates the assembly of Psgpiso- Si465Psti (Xbal-Pstl) and Si265Xhoi {Pstl-Xhol) were 
ligated into the plasmid containing P4gpi5o with the stop codon. P4gpi5o plasmid was opened 
at the Xba\ and Xhol sites for the ligation. 

Figure 19 illustrates the assembly of P8gpi4o. Si465Psti (Xbal-Pstl) and Si265Xhoi (Pstl-Xhol) were 
35 ligated into the plasmid containing P4gpi4o with a stop codon. P4gpi4o plasmid was opened 
at the Xba\ and Xhol sites for the ligation. 
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Figure 20 illustrates the assembly of P8gp4i' Two complementary nucleotide strands 
1265gp41S and 1265gp41AS designed with overhang creating a 5' Xhol and a 3' Pst\ 
restriction enzyme site were aneaied and ligated into the piece 8 which is already opened 
5 at the Xho\ and Fsfl sites whereby Si 265 is deleted. 

Figure 21 illustrates the assembly of P7. Ps (XA7ol-Sacll) and Saoeosacn (Sacll-C/al) were 
ligated into P5 plasmid opened at Xho\ and C/al. 

10 Figure 22a SDS PAGE of ^^S-labelled HIV-1 BX08 envelope glycoproteins radio-immuno 

precipitated from transiently transfected 293 cells using the indicated plasmids. Cell pellet 
(membrane bound antigens) or cell supernatant (secreted antigens) were precipitated by 
a polyclonal anti-HIV-1 antibody pool. Lane 1: untransfected cells. Lane 2: supernatant 
from syn.gp120MN transfected cells. Lane 3: cell pellet from wt.gp160Bxo8 transfected cells. 

15 Lane 4: cell pellet from cells co-transfected by wt.gpieOexos and pRev. Lane 5: Mwt. 
marker. Lane 6: cell pellet from syn.gpieOsxos transfected 293 cells. Lane 7: cell pellet 
from syn.gp150Bxo8 transfected 293 cells. Lane 8: supernatant from syn.gp140Bxo8 
transfected cells. Lane 9: supernatant from syn.gp120Bxo8 transfected cells. 

20 Figure 22b is an SDS-PAGE of ^^S-labeled HIV-1 BX08 envelope glycoproteins radio- 
immune precipitated from transiently transfected 293 cells as cell pellet (membrane 
bound) or cell supernatant (secreted antigens) by anti-HIV-1 antibody pool using the 
indicated plasmids. Lane 1: untransfected 293 cells. Lane 2: cell pellet from syn.gp160MN 
transfected 293 cells as positive control (Vinner et al 1999). Lane 3: Cell supernatant from 

25 syn.gp120MN transfected 293 cells as positive control (Vinner et al 1999). Lane 4: Cell 
supernatant from syn.gp120BX08 transfected 293 cells demonstrating a glycoprotein 
band of 120 kDa. Lane 5: Cell supernatant from syn.gp140BX08 transfected 293 cells 
demonstrating a glycoprotein band of 120 kDa. Lane 6: Mwt. marker. Lane7 at two 
different exposure times: Cell pellet from syn.gp150BX08 transfected 293 cells 

30 demonstrating a glycoprotein band of 120 kDa (lower gp30 band is not well seen in this 
exposure). Lane 8: Cell supernatant from syn.gp150BX08 transfected 293 cells showing 
no secreted proteins (all protein is membrane bound, see lane 7), 

Figure 22c show fluorescent microscopy of U87.CD4.CCR5 cells transfected with BX08 
35 gp160 genes plus pGFP. Panel A: cells transfected with empty WRG7079 vector plus 
pGFP showing no syncytia. Panel B: cells transfected with wild type BX08gp160 gene 
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plus pGFP showing some syncytia. Panel C: cells transfected with synBX08gp160 plus 
pGFP showing extreeme degree of syncytia formation. This demonstrates expression, 
functionality, and tropism of the expressed BX08 glycoprotein with much more expressed 
functionally active gp160 from the synthetic BX08 gene. 

5 

Figure 23 shows the anti-Env-V3 BX08 antibody titers (lgG1). Panels show individual mice 
DNA immunized with syn,gp140BX08 plasmid either i.m. (left panel) or by gene gun (right 
panel), respectively. Immunization time points are indicated by arrows. 



10 Figure 24 shows a Western Blotting of (from left to right) one control strip, followed by sere 
(1:50) from 2 mice i.m. immunized with synBX08gp120, 2 mice i.m. immunized with 
synBX08gp140, 2 mice i.m. Immunized with synBX08gp150, and 2 mice immunized with 
synBX08gp160, followed by 2 mice gene gun immunized with synBX08gp120, 2 mice 
gene gun immunized with synBX08gp140, 2 mice gene gun immunized with syn 

15 BX08gp150, and 2 mice gene gun immunized with synBX08gp160 respectively. Strip 5 is 
a mouse 5.1 DNA immunized i.m. with synBX08gp140 plasmid (same mouse as in figure 
23). Plasma was examined at week 18. The positing of gp160 (spiked with four coupled 
gp51), gp120 and gp41 is indicated at the right. A positive reaction to HIV glycoproteins 
futher demonstrates the mouse anti-HIV immunoglobulin reacting to HIV of a strain (IIIB) 

20 different from BX08 to illustrate cross-strain reactivity. 

Figure 25 Theoretical example of calculation of the 50% inhibitory concentration (IC50) 

values. ICso for each mouse serum is determined by interpolation from the plots of percent 
inhibition versus the dilution of serum. 

25 

Figure 26 CTL responses were measured at week 18 to the mouse H-2D^ restricted BX08 
V3 CTL epitope (IGPGGRAFYTT) for BALB/c mice (H-2D*^) i.m. immunized at week 0, 9, 
and 15 with the synthetic vaccine genes: syn.gp120Bxo8, syn.gpMOexos, syn.gp150Bxo8, 
and syn.gp160Bxo8. respectively, and median values of different E:T ratios for groups of 
30 mice are shown (26A). Intramuscular DNA immunization with syn.gp150Bxo8 induced a 
higher CTL reponse when injected i.m. in high amounts versus gene gun inoculation of 
skin (26B). 



35 



Figure 27 Summary of western immune blotting assay of mice sera (1:40) collected at week 
0, 9, and 18 from mice genetically immunized with syn.gp120Bxo8, syn.gpMOsxos, 
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syn.gp150Bxo8, syn.gp160Bxo8, wt.gp160Bxo8, and wt-gplSOexos plus pRev, respectively. 
Percent responders in groups of 17-25 mice against gp120 and gp41 are shown. 

Figure 28 IgG anti-rgp120 (NIB) antibody titers of individual mice inoculated at week 0, 9, 15 
(28A), or gene gun immunized at week 3, 6, 9, and 15 (28B) witfi tfie syn.gp150Bxo8 DNA 
vaccine. 

Figure 29 IgG antibody titers to HIV-1 rgp120||,B. Median titers are sliown from groups of 
mice i.m. inoculated at week 0, 9, and 15 (29A), or gene gun immunized at week 0, 3, 6, 
9, and 15 (298) with the synthetic genes syn.gp120Bxo8, syn.gp140Bxo8, syn.gp150Bxo8, 
and syn.gp160Bxo8, respectively. 
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Examples 

Example 1: Designing the nucleotide sequence construct 

Initially the overall layout of the nucleotide sequence construct is decided. The overall layout 
comprises the various derivatives the gene will be expressed as. For BX08 these include, but 
5 are not restricted to gp160, gp150, gp140. gp120, and gp41 . 

Next, the vehicle of expression (plasmid or virus) is to be determined: Preparation for a 
suitable vector determines both need for leader sequence, terminal restriction enzyme sites 
and whether or not an N- or C-terminal protein tag is to be considered (Poly-his, Myc- 
antibody-epitop, etc.). For BX08 a plasmid expression vehicle was chosen. All native wild 

10 type HIV codons are systematically exchanged with the codons most frequently represented 
in a pool of highly expressed human genes (figure 1). By this exchange the amino acid 
sequence is conserved while the nucleotide sequence is dramatically altered. Thus, gene 
structures like overlapping reading frames (e.g. vpu, rev, and tat) or secondary structures 
(e.g. RRE) are most likely destroyed whereas protein cleavage sites, and glycosylation sites 

15 are maintained. The 100% amino acid identity between wtBXOS and synthetic 8X08 in the 
present examples should be calculated after the initial Aia-Ser amino acid sequence, as that 
sequence is a part of the 6 amino acid sequence long Nhe\ restriction enzyme site. 

Depending on the restriction enzyme sites located in the expression vector it is decided 
20 which restriction enzyme sites can be present (tolerated) throughout the finished gene 

construct. The terminal restriction enzyme sites of the synthetic gene must remain unique to 
enable cloning into the vector chosen for expression. General requirements for restriction 
enzyme sites of choice: Preferably creating cohesive ends facilitating ligation, creating no 
compatible ends with adjacent restriction enzyme sites (e.g. BamHI/Sg/ll), and being efficient 
25 cutters. For BX08 the restriction enzyme sites accepted were the ones present in the 
polylinker of the pBluescript cloning vectors (EagI, Mlu\, EcoRV, Psfl, C/al, EcoRI, Xbal, 
SacI, Spel, X/70I, Hind\\\, Sacll, A/ott, BamHI, Smal, Sa/I, Dral, Kpn\ with the exception of 
BglW and Nhe\), This was decided to satisfy the original cloning strategy using individual 
cloning of snuts in pBluescript with restriction enzyme cleaved (trimmed) ends after PGR 
30 amplification, which is not necessary when blunt-end cloning and assembling of 

complementary oligonucleotides are employed. All locations at which the selected restriction 
enzyme sites can be introduced by silent mutations (keeping 100% loyal to the amino acid 
sequence) are identified using the SILMUT software or equivalent. 
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From these possible restriction enzyme sites, a selection of restriction enzyme sites are 
introduced by silent nucleotide substitutions around functional regions of choice of the 
corresponding gene (e.g. RRE) or gene products (e.g. variable region 1 (V1), V2, V3, CD4 
binding area, transmembrane domain, and regions of immunological significans, etc.). 
5 Restriction enzyme sites are located at terminal positions of subcloned snuts (building 

entities) but additional restriction enzyme sites may be present within subunits. For BX08 the 
construct was initially to be cloned in the WRG7079 vector containing a tPA-leader 
sequence. Cloning sites were 5'-Nhe\ SamHI-3'. The entire humanised BX08 sequence 
was divided into thirds: 5'-Nhe\ Xho\ SacW SamHI-S'. These sites were chosen in 
10 this particular order because it resembles the polylinker of pBluescript (KS ) enabling 
successive ligations of the assembled thirds in this cloning vector. Within these thirds 
restriction enzyme sites were kept unique. Next, restriction enzyme sites were placed to flank 
the functional regions chosen as follows: 

A. (5W1): EcoRV-235: Between C1 and V1 . Alternatives: 3xH/A7Qflll (already excluded 
15 because exclusive use at position 1890) or EcoRV. 

B. (VI -3'): Only alternative Pst\ 375. 

C. (5 -V2): as B. 

D. (V2-3'): Alternatives: Spel, C/al 495. C/al chosen because it is closer to V2. 

E. EcoRI 650 placed because next possible site was too far away. 

20 F. (5'-V3): BglW 720 was the alternative closest to the V3 region and further more unique. 

G. (V3-3'): Alternatives Xho\ (excluded) and Xbal 900 located very close to the V3 loop. 

H. (5'-V4) Sad 990: alternatively EcoRI or BamHI (both excluded) 

I. (V4-3*): Alternatives Spel 1110, Kpnl 1145, Pstl 1135. Pst\ already used, Spel chosen 
because of distance to previous site (Sad 990). 

25 J. (5'-V5): Xho\ initially determined. 

K. (Fusion peptide-3*) Pst\ 1465 was the closest alternative to Xho\ 1265. 

L. (5'-lmmunodominant region): Xba\ 1630 chosen among EcoRV (blunt end), Pstl and Xho\ 

(both already used). 

M. (Immunodominant region-3*): EagI 1700 perfect location. 
30 N. (C34 and C43 -3' (Chan, Fass, et al. 1997), and 5'-trans membrane domain): Sadl. No 
alternatives. 

O. (Trans membrane domain -3'): Sadl 2060 already present. 
P. C/al 2190 perfect position in relation to previous RE-site. 
Q: Pstl perfect position in relation to previous RE-site. 
35 R. EcoRI 2400 introduced to facilitate later substitution of terminal snut. 
S. SamHI 2454 determined by the WRG7079 vector. 
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Remove undesired restriction enzyme sites by nucleotide substitutions (keeping loyal to the 
amino acid sequence). Nucleotide substitution should preferably create codon frequently 
used in highly expressed human genes (figure 1). If that is not possible, the codons should 
5 be the selected from the regular codons (figure 7). The substitutions made to the second 
nucleotide sequence to obtain desired restriction enzyme sites are shown in Table 2. 



Table 2 lists silent nucleotide substitutions in the humanised BX08 envelope 
10 sequence. Substitutions were made to create or delete restriction enzyme sites. 



Position: 


substitution 


Remarks: 


138 


c^g 


creates Mlu 1 site on pos. 134-139 


240 


c^t 


creates EcoRV site on pos. 238-243 


501 


c — > a 


creates Cla 1 site on pos. 501-506 


502 


a ->t 


do 


503 


g ^ c 


do 


504 


c~»g 


do 


657 


c->^a 


creates EcoRI site on pos. 656-661 


660 


c^t 


do 


724 


c a 


creates Bgl II site on pos. 724-729 


726 


c^g 


do 


727 


a->t 


do 


728 


g->c 


do 


729 


c^t 


do 


840 


c-> t 


EagI site is eliminated 


904 


a -»t 


creates Xba 1 site on pos. 904-909 


905 


g 0 


do 


906 


c->t 


do 


907 


c ^ a 


do 


909 


c ^ a 


do 


QQ^ 


a t 


creates oac i site on pos. b^^u-^^o 


996 


g->c 


do 


1116 


c->t 


creates Spel site on pos. 1114-1119 


1119 


c~>t 


do 


1273 


a^t 


creates Xhol site on pos. 1272-1277 


1274 


g -> c 


do 


1275 


c->g 


do 


1293 


c->t 


Bgl II site is eliminated 


1443 


c^t 


BstXI site is eliminated 


1452 


g c 


do 


1467 


c^t 


PstI site on pos. 1466-1471 


1470 


a 


do 


1590 


g->c 


PstI site on pos, 1588-1593 Is eliminated 


1620 


g^c 


PstI site on pos. 1618-1623 is eliminated 


1638 


c->t 


creates Xbal site on pos. 1638-1643 


1641 


g a 


do 


1653 


g->c 


PstI site is eliminated 


1687 


a->t 


PstI site is eliminated 


1688 


g-^c 


PstI site is eliminated 


1710 


c^g 


creates EagI site on pos. 1709-1714 


1758 


c^t 


Bgl II site is eliminated 


1875 


g^c 


PstI site is eliminated 


1893 


c a 


Hind 111 on pos. 1893-1898 


1897 


c->t 


do 


1944 


c->t 


Bgl II site is eliminated 
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Position: 



substitution 



Remarks: 



2199 
2202 
2203 
2253 
2292 
2320 
2321 
2322 
2325 
2430 
2433 



c->a 
c-> a 
c-»t 



c->t 
c-->t 
c-»t 
c^g 
g-^c 
a^t 
a->t 



creates EcoRI site on pos. 2429-2434 
do 



do 
do 
do 



Sacll site Is eliminated 

PstI site is eliminated 

PstI site on pos. 2321-2326 is eliminated 



do 

do 



Cla I site on pos. 2198-2203 



Example 2: synthesis of oiigos 

In order to clone the individual snuts, nucleotide strands were synthesised or purchased. In 
total 28 synthetic nucleotide strands were synthesised. Nucleotide strands were synthesised 
5 by standard 0.2 iiimol p-cyanoethyl-phosphoramidite chemistry on an Applied Biosystems 
DNA synthesiser model 392, employing 2000 A CPG columns (Cruachem, Glasgow, 
Scotland), acetonitrile containing less than 0.001% water (Labscan, Dublin, Ireland) and 
standard DNA-synthesis chemicals from Cruachem. including phosphoramidites at 0.1 M 
and Tetrahydrofuran/N-methylimidazole as cap B solution. The nucleotide strands O-N-C 

10 and 1 19MS-RC (for cloning of snut O-N-Lang), 650-E-BG and 720-XBAC-31 (for cloning of 
snut 650-720-EcoRI), 2425esup and 2425ESdo (for cloning of snut 2425-E-S) were 
synthesised with 5' end "trityl on" and purified on "Oligonucleotide Purification Cartridges" 
(Perkin Elmer, CA, USA) as described by the manufacturer. Other nucleotide strands (235- 
EC05, 375-pstl.seq, 495-Cla1.seq.900-Xbal, 990-sacl, 1110-SPE, 1630-Xba.seq, 1700- 

15 Eag.seq, 17-Eag.seq, 1890-Hind.MPD, 2060-sac, 2190-cla, 2330-pst) were synthesised with 
5' end "trityl off" and purified by standard ethanol precipitation. Oligoes 1265-1 UP, 1265- 
1DO, 1265-2UP, 1265-2DO, 1265-3UP, 1265-3DO, 1465-1UP, 1465-1DO, 1465-2UP, 1465- 
2DO were purchased from Pharmacia. 

Example 3: Cloning of snuts 

20 The nucleotide sequence construct was designed in 17 DNA small pieces called snuts 

(Table 3) encompassing important structures like variable and constant regions each flanked 
with restriction enzyme (RE) sites to facilitate cassette exchange within each third of the 
gene: Nhel-Xhol XholSacll Sacll-BamHI. 

25 Each snut was cloned individually in a commercial vector (pBluescriptKS or pMOSblue) and 
kept as individual DNA plasmids, named after the snut which gives the nucleotide position of 
the RE in the BX08. 
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Table 3 list the snuts by their name and cloning vector. 



Name 


Cloning vector: 


So-N-Lang 


pMOSblue 


S235EC0RV 


pMOSblue 


S375Pstl 


pBluescriptSK 


S495Clal 


pMOSblue 


S650-720ECORI 


pMOSblue 


SgooXbal 


pMOSblue 


SoQOSacI 


pMOSblue 


Smospel 


pMOSblue 


Si265Xhol 


pBluescriptSK 


Si465Pstl 


pBluescriptSK 


Si630Xbal 


pBluescriptSK 


Si700Eagl 


pBluescriptSK 


Si890Hlndlll 


pBluescriptSK 


S2060Sacll 


pMOSblue 


S2190Clal 


pMOSblue 


S2330Pstl 


pMOSblue 


S2425ES 


pBluescriptSK 



Three principally different methods were used to obtain the dsDNA corresponding to each of 
5 the 17 snuts needed to build the synthetic BX08 genes. 

1) "Overlapping" PCR: is based on the use of two ssDNA template nucleotide strands 
(forward and reverse) that complement each other in their 3'-end (figure 8). During the first 
PCR cycle, both templates annealed to each other at the 3 -ends allowing the full-length 
10 polymerisation of each complementary strand during the elongation step. The newly 

polymerized dsDNA strand are then amplified during the following cycles using an adequate 
forward and reverse primers set (figure 8). 

Snut O-N-LANG: (So-N-tang) two ng of the fonvard template nucleotide strands O-N-C and 2 
15 ng of the reward template-nucleotide strand 1 19MS-RC were mixed together with SOpmoles 
of the foHA^ard primer O-N-LANG-5 (5'-CTAGCTA-GCGCGGCCGACCGCCT -3*) and 
SOpmoles of the reverse primer O-N-LANG-3 (5'-CTCGATATCCTCGTGCATCTGCTC -3') in 
a lOO^il PCR reaction volume containing 0.2mM dNTP's, 1x ExpandHF buffer with MgCl2 
(I.SmM) and 2.6 units of enzyme mix (Expand™ High fidelity PCR system from Boehringer 
20 Mannheim). The PCR was performed with the PE Amp 9600 thermocycler (Perkin Elmer) 
using the following cycle conditions: initial denaturation at 94^0 for 30 sec, followed by 30 
cycles of 94'*C for 15 sec, 65°C for 30 sec, 72''C for 45 sec, with a final elongation at 72*'C 
for 5 min,, and cooling to 4*'C. 
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Snut 650-720-EcoRI: (S65o-72oecori) PGR amplification was performed as described for snut 
O-N-LANG. One of the forward ssDNA template-oligonucleotide 650-E-BG and 1 ]xg of 
the reverse ssDNA template-oligonucleotide 720-XBAC were mixed with 40pmoles of the 
foHA/ard primer 650-E-5 (5'-CCGGAATT-CGCCCCGTGGTGAGCA-3') and 40 pmoles of the 
5 reverse primer 720-X-3 (5'-CTGCTCTAGAGATGTTGCAGTGGGCCT-3'). 

2) "Normal" PCR amplification: Eleven nucleotide strands: 235-EC05, 375-pst1, 900-xba1, 
990-sac1, 1110-SPE, 1630-XBA, 1700-EAG, 1890-HIN, 2060-sac, 2190-cla, and 2330-pst, 
were designed with common 5' and 3' flanking sequences which allowed PCR amplification 

10 with the same primer set (FonA/ard primer : BX08-5 (5'-AGCGGATAACAATTTCACACAGGA- 
3') and revers primer : BX08-3 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3') (Figure 9). The 
495-Cla1 oligonucleotide was designed without a common flanking sequence and was 
therefore amplified with a specific set of primers 495-5N/495-3N ( 5 -GAATCGAT- 
CATCACCCAG-3* and 5'-GACGAATrCCGTGGGTGCACT-3'). Each oligonucleotide was 

15 resuspended in 1mi of water and kept as a stock solution (approximatively 0.2 mM). PCR 
amplification was performed with the Expand™ High Fidelity PCR System from Boehringer 
Mannheim (Cat. No. 1759078). Four concentrations of template nucleotide strand were 
systematically used: undiluted stock solution, stock solution 10"\ stock solution 10"^, stock 
solution 10'^. One to 5 \i\ of synthetic ssDNA template was amplified using the following 

20 conditions: BX08-5 (O.SuM). BX08-3 (0.5^M), 4 dNTP's (0.2mM), 1x ExpandHF buffer with 
MgCl2 (1.5mM) and 2.6 units of enzyme mix. The PCR was performed using the PE Amp 
9600 thermocycler (Perkin Elmer) using the following cycle conditions: initial denaturation at 
94**C for 15 sec, followed by 30 cycles of 94°C for 15 sec, 65**C for 30 sec and 72**C for 45 
sec, with a final elongation at 72°C for 7 min., and cooling to 4°C. 

25 

3) Minigene approach : This method was used to synthesise Si265Xhoi, Suesxtai and S2425ES. 
Snut 2425-E-S (S2425ES): 100 picomoles of each oligonucleotide 2425ES-up (35-mer ; 5 - 
AATTCGCCAGGGCTTCGAGCGCGCCCTGCTGTAAG-3') and 2425ES-do (35-mer ; 
GATCCTTACAGCAGGGCGCGCTCGAAGC-CCTGGCG-3') were mixed together in a 100 ^il 

30 final volume of annealing buffer containing NaCI 25mM , Tris lOmM and ImM EDTA. After 
denaturation at 94°C for 15 min., the mixed oligonucleotides were allowed to anneal at 65**C 
during 15min.. The annealing temparature was allowed to slowly decrease from the 65'*C to 
room temperature (22°C) during overnight incubation. The resulting double-strand dsDNA 
fragments harbored EcoRI- and BamHI-restriction sites overhangs that allowed direct cloning 

35 in pBluescript KS(+) vector using standard cloning techniques (Maniatis 1996). 
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Snut 1265-Xhol (Si265Xhoi): This snut was built according to the strategy depicted in figure 10. 
Three minigenes were constructed following the same method described for snut 2425-E-S. 
These minigenes are named 1265-1, 1265-2 and 1265-3. The minigene 1265-1 results from 
the annealing of the oligonucleotides 1 265-1 up (68-mer ; 5'-tcg agc ago ggc aag gag att 
5 rrc cgc ccc ggc ggc ggc gac atgc gcg aca act ggc gca gcg agc t-3') and 1 265-1 do (68- 
mer ; 5-GTA CAG CTC GCT gcg CCA GTT GTC GCG CAT GTC GCC GCC GCC GGG GCG G AAA ATC 

TCC TTG CCG CTG c-3'). 1265-2 results from the annealing of 1265-2up (61-mer ; 5'-gta caa 

GTA CAA GGT GGT GAA GAT CGA GCC CCT GGG CAT CGC CCC CAC CAA GGC CAA GCG C-3') and 
1265-2dO (63-mer ; 5'-CAC GCG GCG CTT GGC CTT GGT GGG ggc gat GCC CAG GGG CTC GAT 

10 CTT CAC CAC CTT GTA CTT-3'). Finally, 1265-3 results from the annealing of 1265-3up (69-mer 

; 5'-CGC GTG GTG CAG CGC GAG AAG CGC GCC GTG GGC ATC GGC GCT ATG TTC CTC GGC TTC CTG 

GGC GCT GCA-3') and 1265-3do (59-mer ; 5'-GCG CCC AGG aag ccg agg aac ata gcg ccg 
ATG CCC ACQ GCG CGC TTC TCG CGC TGC AC-3'). Each minigene were designed in order to 
present single strand overhangs at their 5" and 3'- ends that allow easy ligation and Xhol-PstI 
15 direct cloning into pBlueScript KS+ vector. 

Snut 1465-Pstl (Susspsh): Two minigenes were constructed following the same methode 
described for snut 2425-E-S. These minigenes are named 1465-1 and 1465-2. The minigene 
1465-1 was obtained after annealing of 1465-1 up (90-mer : 5'-GGC AGC ACC ATG GGC GCC 

20 GCC AGC CTG ACC CTG ACC GTG CAG GCC CGC CAG CTG CTG AGC GGC ATC GTG CAG CAG CAG 
AAC AAC CTG CTG-3') and 1 465-1 do (98-mer : 5'-CGC gca GCA GGT TGT TCT GCT GCT GCA CGA 
TGC CGC TCA GCA GCT GGC GGG CCT GCA CGG TCA GGG TCA GGC TGG CGG CGC CCA TGG TGC 

TGC CTG CA-3'), whereas minigene 1465-2 results from the annealing of 1465-2up (78-mer ; 

5-CGC GCC ATC GAG GCC CAG CAG CAC CTG CTC CAG CTGA CCG TGT GGG GCA TCA AGC AGC TCC 
25 AGG CCC GCG TGC TGG CT-3') and 1465-2dO (78-mer ; 5'-CTA GAG CCA GCA CGC GGG CCT GGA 
GCT GCT TGA TGC CCC ACA CGG TCA GCT GGA GCA GGT GCT GCT GGG CCT CGA TGG-3'. Each 

minigene were designed in order to present single strand overhangs at their 5' and 3'- ends 
that allow easy ligation and Pstl-Xbal direct cloning into pBlueScript KS+ vector using 
standard cloning techniques (Maniatis) (see figure 11). 

30 Example 4: assembly of snuts to pieces. 

The snut genes were then assembled into pieces (Table 4) so that unique restriction enzyme 
sites or mutagenesis can be used within each of these. This strategy will require fewer 
assemblings for optimal use of the cassette system. The following piece clones were made 
and kept individually for construction of the synBXOS gp160 gene (Figure 6): 
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Table 4 lists pieces by their name and their snut composition. 



Piece 
name 


snut composition 


vector 


Pi 


So-N-LANG-S235EcoRV-S375Pstl 


pBluescriptSK 


P2 


SgooXbarSggoSacrSi 1 loSpel 


pMOSblue 


Ps 


Pl -S495Cla|-S660*720EcoRr P2 


pBluescriptSK 


P4gp160 


Si 630XbarS 1 700EagrS 1 890Hinlll 


pBluescriptSK 


Ps 


S2190ClarS2330PstrS242SES 


pBluescriptKS 


P7 


P8gp160-S2060SacirP5 


pBluescriptKS 


Psaoieo 


Si265XhorSi465PstrP4aDl 60 


pBluescriptKS 



Piece 1: The building strategy is shown in figure 12. 

Preparation of the insert DNA: Five to 15^g of each plasmid 0-N-LANG-cl7 and 235-EcoRV- 
5 cl5N, respectively, were double-digested by Xbal/EcoRV, and Pstl/EcoRV, according to 
classical RE digestion procedure (Maniatis). The RE digestion products, were agarose gel 
purified according classical method (Maniatis). All RE digests were loaded on a 3% Nusieve 
3:1(FMC), TBE 0.5X agarose gel and submitted to electrophoresis (7 Volts/mm during 2- 
3hours) until optimal fragment separation. The agarose-band containing the DNA fragments 

10 that correspond to the snut's sequence sizes (243-bp for O-N-LANG and 143-bp for 235- 
EcoRV) were excised from the gel. The DNA was extracted from agarose by centrifugation 
20min at 5000g using a spin-X column (Costar cat#8160). Preparation of the vector: The 
snut 375-Pstl kloni was used as plasmid vector. Five )iig were digested with Xbal and Psti . 
Removal of the polylinker Xbal/PstI fragment was performed by classical agarose gel 

15 purification, using a 0.9% Seakem-GTG agarose, TBE 0.5X gel. The linearised plasmid DNA 
was extracted from the agarose by filtration through spin-X column. All purified DNA 
fragments were quantified by spectrophotometry. Ligation: All three DNA fragments O-N- 
LANG (Xbal/EcoRV). 235-EcoRV (Pstl/EcoRV) and 375-Pstl(Xbal/Pstl), were ligated 
together by classical ligation procedure, using an equimolar (vector:insert1:insert2) ratio of 

20 1:1:1. Thus for, 200 ng (0.1 pmole) of Xbal/Pstl-linearised 375-Pstl-cl1 were mixed with 16 
ng of O-N-LANG (Xbal/EcoRV) and 10 ng of 235-EcoRV (Pstl/EcoRV) in a final reaction 
volume of 20^1 of IX ligation buffer containing 10U of T4 DNA ligase (Biolabs, cat#202S). 
The ligation was allowed overnight at IS'^C. Transformation: Competent XLI-Blue bacteria 
(Stratagene cat#200130, transformation efficiency > 5*10®col/jxg) were transformed by 

25 classical heat-chock procedure : 1/1 0th of the pre-chilled ligation reaction was mixed with 
50^1 of competent bacteria. The mixture was allowed to stand in ice during 30 min. Bacteria 
were heat-shocked at 42^C during 45 sec. and then left 2 min. on ice before being 
resuspended in 450 ix\ of SOC medium. Transformed bacteria were incubated 1 hour at 37°C 
under shaking (250rpm) and plated on LB-ampicilin agar plates. The recombinant clones 
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were allowed to grow 16 hours at SZ'^C. Colony screening: 10 to 50 recombinant colonies 
were screened by direct PGR screening according to the protocole described into the 
pMOSBlue blunt-ended cloning kit booklet (RPN 5110, Amersham). Each colony was picked 
and resuspended in 50jLil of water. DNA was freed by a boilling procedure (lOO'^C, 5 
5 min).Ten ^il of bacterial lysate were mixed to 1 ^1 of a lOmM solution of premixed 4 dNTP's , 
1 ^1 of MISreverse primer (5pmoles/^l, 5'-CAGGAAACAGCTATGAC-3'), 1 ^1 of T7 primer 
(5pmoles/^l, 5'-TAATACGACTCACTATAGGG-3'), 5^1 of lOx Expand HF buffer 2 
(Boehringer Mannheim, cat#1 759078), 0.5|liI of Enzyme mix (Boehringer Mannheim, 5U/|xl) 
in a final volume of 50|Lil. DNA amplification was performed with a thermo-cycler PE9600 

10 (Perkin-Elmer) using the following cycling parameters: 94''C, 2min, 35 cycles(94°C, 30sec; 
SO^'C, 15sec; 72°C,30sec); 72X, 5min; 4*^0 hold. Five |il of the PGR products were analysed 
after electrophoresis on a 0.9% SeakemGTG , 0.5xTBE agarose gel. Nucleotide sequence 
confirmation: ds-DNA was purified from minicultures of the selected clones with the JETstar 
mini plasmid purification system ( Genomed Inc.). Sequencing was performed using 

15 M13reverse and T7 primers and with the Big DyeTM Terminator Gycle Sequencing Ready 
reaction kit (Perkin-Elmer, Nonwalk, Gonnecticut, P/N4303152) and the ABI-377 automated 
DNA sequenator (Applied Biosystems, Perkin-Elmer, Norwalk, Gonnecticut). Data were 
processed with the Sequence Navigator and Autoassembler softwares (Applied Biosystems. 
Perkin-Elmer, Norwalk, Gonnecticut). 

20 

Piece 2: The strategy for building that piece is depicted in figure 13. RE digestion, DNA 
fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The linearised plasmid 1 1 10-Spel-cl24M1 was used as vector after being digested by 
25 Hindu I and Spel, and agarose gel purified. 

- A 166-bp Hindlll/SacI, obtained from snut 900-Xbal-cI15, as well as a 130-bp Sacl/Spel 
fragment, obtained from snut 990-Sacl-cl14, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

30 - lOO^il of competent SGS1 10 bacteria (Stratagene cat# 200247) were transformed with 
1/1 0th of the ligation products according to the manufactor instruction. 

- Direct colony PGR screening was performed using T7 primer and pMOS-R (5 - 
GTTGTAAAAGGAGGGGGAG-3'). 
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Piece 3: The strategy for building that piece is depicted in figure 14. RE digestion, DNA 
fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece1-cl33 was linearised by Clal and Xhol, in order to be used as vector, 
5 and agarose gel purified. 

- A 161-bp Clal/EcoRI fragment, obtained from 495-Clal-cl135M1 as well as a 254-bp 
EcoRI/Xbal fragment, obtained from 650-720-EcoRI-cl39, and a 374-bp Xbal/Xhol fragment, 
obtained from piece2-cl4, were agarose gel purified. 

- Equimolar amount (0.1 pmole) of each of these 4 DNA fragments were mixed and ligated 
10 together. 

- 50^l of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel. 

- Direct colony PGR screening was performed using MISReverse and T7 primers. 

15 Piece 4 gp160: The strategy for building that piece is depicted in figure 15. RE digestion, 
DNA fragments purification, ligation as well as direct PGR screening of recombinant colonies 
were performed according the same procedures described above for piecel, except the 
following: 

- The plasmid 1630-Xbal-cl2 was linearised by Sacll/ EagI digestion and agarose gel 
20 purified, in order to be used as vector. 

- A 190-bp Eagl/Hindlll fragment, obtained from snut 1700-Eagl-cl4, as well as a 177-bp 
Sacll/Hindlll fragment, obtained from snut 1890-Hindlll-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

25 - 50^1 of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

- Direct colony PGR screening was performed using MISReverse and T7 primers. 

Piece 4-gp150: PGR-based site-directed mutagenesis was performed on double-stranded 
30 plasmid-DNA from piece4-cl4 according an adaptation of the ExSite™ PGR-Based Site- 
Directed Mutagenesis Kit procedure (Stratagene cat#200502)(Weiner, M.P., Costa, G.L., 
Schoettlin, W., Gline, J., Marthur, E., and Bauer, J.G. (1994) Gene 151:119-123). The 
mutations introduced are shown in bold letters in the primer sequences below. PGR 
amplification was performed with the Expand™ High Fidelity PGR System (Boehringer 
35 Mannheim, cat#1759078). Briefly, 1.5 ing, 0.5 |j.g or 0.1 ing of circular dsDNA was mixed with 
1.5 pmoles of P4M2S (5'-TGTGGAAGGTGAGGGGGGTGCATGGGTGGG-3') and 1.5 
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pmoles of P4M2AS (5'-CCCGCCTGCCCGTGTGACGGATCCAGCTCC-3') in a final volume 
of SOfil containing 4 dNTPs (250jiM each), 1x Expand HF buffer 2 (Boehringer Mannheim, 
cat#1 759078), 0.75|liI of Enzyme mix (Boehringer Mannheim, 5U/^I). The PGR was 
performed with a PE9600 thermo-cycler (Perkin-Elmer Corporation) under the following 
5 cycling parameters : 94°C, 2min ; 15 cycles (94 ^C, 45 sec ; 68 °C, 4 min); 72^C, 7min and 4 
"^C, hold. PGR products were phenolichloroform extracted and precipitated (Maniatis). 
Plasmid template was removed from PGR products by Dpnl treatment (Biolabs)(Nelson, M., 
and McGlelland, M., 1992) followed by ethanol-precipitation. Amplicons were resuspended in 
50 |Lil steril water, and phosphorylated according the following procedure: 7.5 |al of amplicons 

10 were mixed with 0.5h.I of lOOmM DTT, 1 ^1 of lOx pk buffer and 1 jal of pk mix enzyme 

(pMOSBlue blunt-ended cloning kit, Amersham , cat#RPN 5110). DNA kinasing was allowed 
5min at 22*^C. After heat-inactivation (10min, 75°C) of the pk enzyme. I^il of ligase (4units, 
Amersham , cat#RPN 5110) was directly added to the pk reaction. The ligation was allowed 
overnight at 22X. 50|uil of competent XLIBlue bacteria were transformed with 1/1 0th of the 

15 ligation reaction according to the classical protocol (Maniatis). Insertion of mutations was 
checked by sequencing. 

Piece 4-gp140: PCR-based site-directed mutagenesis was performed on piece 4-cl4, 
according to the procedure described for piece4-gp150 except that the primers P4M1AS (5 - 
20 TGTGTGACTGATTGAGGATCCCCAACTGGC-30 and P4S (5'- 
AGCTTGCCCACTTGTCCAGCTGGAGCAGGT-3') were used. 

Snut 1265-Xhol-gp120 : PCR-based site-directed mutagenesis was performed on plasmid 
1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except that the 
25 primers 1265MAS (5-CTTCTCGCGCTGCAGGACGGGGCGGTTGGC-3*) and 1265M2S (5'- 
CGCGCCTAGGGCATCGGCGCTATGTTCCTG-3') were used. 

Snut 1265-Xhol-gp160/uncleaved : PCR-based site-directed mutagenesis was performed 
on plasmid 1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except 
30 that the primers 1265MAS (5'-CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3') and 
1265M2S (5 -AGCGCCGTGGGCATCGGCGCTATGTTCCTC-3*) were used. 

Snut 1465-Pstl-CCG : PCR-based site-directed mutagenesis was performed on plasmid 
1465-Pstl-cl25 according to the procedure described for piece4-gp150 except that the 
35 primers 1465MAS (5'-CTGCTTGATGGCCCACACGGTCAGCTG-3') and 1465MS (5'- 
TGCTGCGGCCGCGTGCTGGCTCTAGA-3') were used. 
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Piece 5 : The strategy for building that piece is depicted in figure 16. RE digestion, DNA 
fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 
5 - The plasmid 2425-ES-cl2 was linearised by Clal/EcoRI digestion and agarose gel purified, 
in order to be used as vector. 

- A 129-bp Pstl/Clal fragment, obtained 2190-Clal-cl6M15, as well as a 114"bp Pstl/Ecorl 
fragment, obtained from 2330-Pstl-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
10 in an one step ligation. 

- 50|al of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel . 

- Direct colony PGR screening was performed using T3 (5'-ATTAAGGGTGAGTAAAG-3') and 
T7 primers. 

15 

piece 8 : The strategy for building that piece is depicted in figure 17. RE digestion, DNA 
fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece4-cl4 was linearised by Xbal/Xhol and agarose gel purified, in order to be 
20 used as vector. 

- A 200-bp Xhol/PstI fragment, obtained from 1265-Xhol-cl2M1 as well as a 178-bp Pstl/Xbal 
fragment, obtained from 1465-Pstl-cl25 were agarose gel purified. 

- Equimolar amount (0.1 pmole) of these 3 DNA fragments were mixed and ligated together. 

- 50|al of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
25 according to the protocole described for piecel . 

- Direct colony PGR screening was performed using T3 and T7 primers. 

piece 8-gp150 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M3 was used as vector (figure 18). 

30 

piece8-gp150/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
except that piece4 gp160-cl4M3 is used as vector and a 200-bp Xhol/PstI fragment, obtained 
from snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from 
snut 1465-Pstl-GGG are used like inserts. 
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piece 8-gp140 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M5 was used as vector (figurelQ). 

piece8-gp140/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
5 except that piece4-cl4M5 is used as vector and a 200-bp Xhol/PstI fragment, obtained from 
snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from snut 
1465-Pstl-CCG are used like inserts. 

Piece8-gp41 : The strategy for building that piece is depicted in figure 20. RE digestion, 
10 DNA fragments purification, ligation as well as direct PGR screening of recombinant colonies 
were performed according the same procedures described for synBXOS gp160 gene. A 63 
bp linker is to be made according to the method described for snut 2425-ES, the minigene 
appraoch. Thus for 2 complementary oligonucleotides: 1265-gp41S(5- 
TCGAGgctagcGCCGTGGGCATCGGCGCTATGTTCCTCGGCTTCCTGGGCGctgca-3') and 
15 1265-gp41AS (S^-gCGCCCAGGAAGCCGAGGAAC- 

ATAGCGCCGATGCCCACGGCgctagcC-3' should be annealed together. This synthetic 
linker will be directly ligated into the Xho\ /Pst\ sites of piece8-klon13 from which the snut 
1265-Xhol-cl 2M1 would have been removed. 

20 piece? : The strategy for building that piece is depicted in figure 21 . RE digestion, DNA 

fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece5-cl1 was linearised by Glal/Xhol and agarose gel purified, in order to be 
used as vector,. 

25 « A 798-bp Xhol/Sacll fragment, obtained from piece8-cl13 as well as a 140-bp Sacll/Glal 
fragment, obtained from 2060-Sacll-cl21 were agarose gel purified. 

- The ligation of the 3 fragments was performed using a vectoninsert ratio of 1:1, 1:2 or 1:5. 

- 50|al of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel. 

30 - Direct colony PGR screening was performed using M13Reverse and T7 primers. 

Example 5: assembly of genes 

synBXOS gp160 gene : The strategy for building that gene is depicted in figure 6. RE 
digestion, DNA fragments purification, ligation as well as direct PGR screening of 
recombinant colonies were performed according the same procedures described for piecel. 
35 20 of the expression plasmid WRG7079 were digested by Nhel/BamHI. Plasmid DNA- 
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ends were dephosphorylated by Calf Intestin Phosphatase treatment (CIP, Biolabs) 
(Maniatis) to avoid autoligation of any partially digested vector. CIP enzyme was heat- 
inactivated and removed by classical phenol-chloroforme extraction. A 1277-bp Nhel/Xhol 
fragment, obtained from piece3-cl27, as well as a 1194'hp Xhol/BamHl fragment, obtained 
5 from piece7-cl1, were agarose gel purified. The ligation was performed using a vectorinsert 
ratio of 1:1 or 1:2. Fifty |il of competent XLIBlue bacteria were transformed with 1/1 0th of the 
ligation product according to the protocole described for piecel . After transformation bacteria 
were plated on LB-kanamycin agar plates. Direct PCR colony screening was performed 
using the primer set WRG-F (5*-AGACATAATAGCTGACAGAC-3') and WRG-R (5'- 
10 GATTGTATTTCTGTCCCTCAC-3'). The nucleotide sequence was determined according the 
methods described above for piecel. 

synBXOS gplSO gene : The strategy for building that gene is depicted in figure 5. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 

15 recombinant colonies were performed according the same procedures described for 

synBXOS gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cI27, as well as 
a 800-bp Xhol/BamHl fragment, obtained from piece 8-gp150-cl26, were agarose gel purified 
and then ligated into the Nhel/BamHI WRG7079 sites. The ligation was performed using a 
vectorinsert ratio of 1:1 or 1:2. 

20 For construction of the synthetic BX08 gpl 50, piece4 was mutated to Piece4gp1 50 whereby 
a tyrosine -> cysteine was changed and a stop codon was introduced after the 
transmembrane spanning domaine ( TMD), followed by a BamHI cloning site. A new 
piece8gp150 was constructed composed of snut1265/snut1465/piece4gp150. 

25 synBXOS gp150/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBXOS gpl 60 gene. A 1277-bp Nhel/Xhol fragment, obtained 
from piece3-cl27, as well as a 800-bp Xhol/BamHl fragment, obtained from piece 8- 
gp150/uncleaved, are agarose gel purified and then are ligated into the Nhel/BamHI 

30 WRG7079 sites. The ligation was performed using a vector:insert ratio of 1:1 or 1:2. 

synBXOS gp 140 gene : The strategy for building that gene is depicted in figure 4. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
35 synBX08 gpl 60 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 647-bp Xhol/BamHl fragment, obtained from piece 8-gp140-cl2, were agarose gel purified 
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and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed using a 
vector:insert ratio of 1:1 or 1:2. For construction of the synthetic BX08 gp140, piece4 was 
mutated to Piece4gp140 whereby a stop codon was introduced just before the TMD followed 
by a BamHI cloning site. A new piece8gp140 was constructed composed of 
5 snut1265/1465/piece4gp140. 

synBXOS gp140/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PGR screening of recombinant colonies are performed according the same 
procedures described for synBXOS gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
10 from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 

gp140/uncleaved, are agarose gel purified and then ligated into the Nhel/BamHI WRG7079 
sites. The ligation was performed using a vectorinsert ratio of 1:1 or 1:2. 

synBXOS gp 120 gene : The strategy for building that piece is depicted in figure 3. RE 
15 digestion, DNA fragments purification, ligation as well as direct PGR screening of 
recombinant colonies were performed according the same procedures described for 
synBXOS gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 206-bp Xhol/BamHI fragment, obtained from 1265-Xhol-gp120-clM5, were agarose gel 
purified and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed 
20 using a vectorinsert ratio of 1:1 or 1:2. For construction of the synthetic BX08 gp120, snut 
1265 was mutated to Si265gpi2o to introduce a stop codon at the gp120/gp41 cleavage site 
followed by a BamHI cloning site. 

The gp160, gp150, gp140, and gp120 genes are cloned (Nhel-BamHI) and maintained in an 
25 eucaryotic expression vectors containing a GMV promoter and a tPA leader, but other 

expression vectors may be chosen based on other criteria e.g. antibiotic resistance selection, 
other leader sequences like GD5 etc, presence or not of immune stimulatory sequences etc. 

SynBXOS gp41 gene : The strategy for building that gene is depicted in figure 20. RE 
30 digestion, DNA fragments purification, ligation as well as direct PGR screening of 
recombinant colonies were performed according the same procedures described for 
synBXOS gp160 gene. Piece 8-gp41 is ligated with snut 2060-Sacll-klon21 and piece 5 as 
already decribed for the construction of piece 7, creating piece 7-gp41 (P7gp4i). Subsequently 
the piece 7-gp41 containing the entire gp41 gene will be cloned In WRG7079 using the Nhel 
35 and BamHI sites. 
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Example 6a: High expression by codon optimization 

To analyse the expression of glycoproteins from the wild type and synthetic BX08 envelope 
genes RlPA was performed on transfected mammalian cell lines. Both cell membrane 
associated and secreted HIV-1 glycoproteins from the cell supernatants were assayed. The 
5 envelope plasmids were transfected into the human embryonic kidney cell line 293 (ATCC, 
Rockville, MD) or the mouse P815 (H-2D'*) cell line using calcium phosphate (CellPhect 
Transfection kit, Pharmacia). For radio immune precipitation assay (RlPA), transfected cells 
were incubated overnight, washed twice and incubated for 1 hour with DMEM lacking 
cysteine and methionine (Gibco). Then the medium was replaced with medium containing 50 

10 pCi per ml of [^^S] cysteine and 50 pCi/ml of [^^S] methionine (Amersham Int., Amersham, 
UK) and incubation continued overnight. Cells were centrifuged, washed twice with HBSS 
and lysed in 1 ml ice-cold RlPA buffer (10 mM Tris, pH 7.4, 150 mM NaCI, 50 mM EDTA, 1% 
Nonidet P-40, 0.5% sodiumdeoxycholate) to detect membrane bound Env glycoproteins. The 
cell lysates were centrifuged for 15 min. at 100,000 x g to remove any undissolved particles 

15 and 100 ]x\ immune precipitated with protein A-sepharose coupled human polyclonal IgG 
anti-HIV antibodies (Nielsen et al., 1987). For analysis of secreted Env glycoproteins 500 |li1 
of the 5 ml supernatants from transfected cells were incubated with protein A-sepharose 
coupled human polyclonal IgG anti-HIV antibodies. After washing three times in cold RlPA 
buffer and once in PBS, the immune precipitates were boiled for 4 min. in 0.05 M Tris-HCI, 

20 pH 6.8, 2% SDS, 10% 2-mercaptoethanol, 10% sucrose, 0.01% bromophenol blue and 

subjected to SDS PAGE (Vinner et al., 1999). Electrophoresis was carried out at 80 mV for 1 
hour in the stacking gel containing 10% acrylamide, and at 30 mV for 18 hours in the 
separating gradient gel containing 5-15% acrylamide. Gels were fixed in 30% ethanol-10% 
acetic acid for 1 hour, soaked for 30 min. in En3Hance (Dupont #NEF 981), washed 2x15 

25 min. in distilled water, dried and autoradiography performed on Kodak XAR-5 film. 
Transfection of human 293 cells with the syn.gp120Bxo8 and syn.gp140Bxo8 genes, 
respectively, resulted in high amounts of only secreted HIV-1 glycoproteins (Fig. 22a, lane 9 
and 8). Thus, the synthetic gene in the absence of rev expresses the HIV-1 surface 
glycoprotein of the expected size which is recognised by human anti-HIV-1 antisera. The 

30 expression of BX08 gp120 was Rev independent and with roughly the same high amount of 
gp120 from the syn.gp120MN gene (Fig. 22a, lane 2). Fig. 22a, lane 6 and lane 7 shows the 
expression of only membrane bound gp160 and gp150 from 293 cells transfected with 
syn.gp160Bxo8 and syn.gp150Bxo8 plasmids, respectively. Also transfection with wt.gplGOBxos 
plasmid resulted in a significant expression of membrane bound gp160 despite the absence 

35 of Rev (Fig. 22a, lane 3). Co-transfection with equimolar amounts of Rev encoding plasmid 
seemed to increase this expression somewhat (Fig. 22a, lane 4). This is seen despite the 
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lower transfection effectivity using two plasmids and the use of only half the amount of 
wt.gp160Bxo8 DNA when combined with pRev. The amounts of secreted HIV-1 glycoproteins 
from gp120 and gp141 accumulating in the cell supernatants seemed higher than the 
amounts of cell associated glycoproteins at the time of harvesting of the cells. Interestingly, 
5 the amounts of gp160 produced from the "humanized" gene were about equal to the 

amounts produced by the wt.gp160Bxo8 + pRev genes, respectively (Fig. 22a, lane 4 and 6). 
The processing of gp160, gp150 and gp140 into gp120 plus a gp41, or fractions of gp41, 
produced from wild type or synthetic genes in the 293 cell-line did not function well under 
these experimental conditions. Same phenomenon was seen in RlPA from 293-CD4 cells 
10 and HeLa-CD4 cells infected by HIV-1 mn (Vinner et al., 19999). Because of the absence of 
CCR5 these cell-lines could, however, not be infected by HIV-1 strain BX08. 



Example 6b: Radio immune precipitation assay (RiPA) of synthetic BX08 transfected 
ceils showing expression of glycoproteins from synthetic BX08 env plasmids 

15 The synthetic envelope plasmid DNA were transfected into the human embryonic kidney cell 
line 293 (ATCC, Rockville, MD) using calcium phosphate (CellPhect Transfection kit, 
Pharmacia). For immune precipitation analysis, transfected 293 cells were treated and 
analysed according to the method described in example 6a. 
To analyse expression from these genes, an SDS-PAGE of the ^^S-labelled HIV-1 

20 envelopes, immune precipitated from the transfected cells is shown in figure 22b. Both cell- 
membrane associated and secreted HIV-1 envelope glycoproteins in the cell supernatants 
were assayed. Transfection of 293 cells with the synthetic BX08 gene encoding gp120 
(syn.gp120BX08) in lane 4, and syn.gp140BX08 (lane 5) that did not contain rev' encoding 
regions, resulted in abundant amounts of HIV-1 gp120. Thus, the expressions were Rev 

25 independent and expressed in roughly same high amounts as the syn.gp120MN and 

syn.gpieOMN genes (lane 3 and 2, respectively) already showed by our group and others to 
be markedly increased in comparisson with HIV MN wild type genes including rev (Vinner et 
al 1999). 

Transfection with syn.gp150 plasmid (lanes 7 and 8) resulted in significant expression of 
30 membrane associated gp120 and low detactable amounts of truncated form of gp41 (cell 
pellet in lane 7) with no detectable HIV-1 glycoprotein in the cell supernatant lane 8. It is 
concluded that the synthetic BX08 genes expres the envelope glycoproteins of expected size 
which are recognised by human anti-HIV-1 antiserum. 
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Exampel 6C FACS 

To quantitate the surface expression of HIV glycoproteins from the wild type and synthetic 
BX08 envelope genes transfection experiments were done and cell surface expression 
5 examined by FACS (flow cytometer). 

10 pig of the BX08 envelope plasmid (wild type BX08gp160 or synBX08gp160) plus 10 iiig of 
an irrelevant carrier plasmid pBluescript were used to transfect a 80-90% confluent layer of 
293 cells in tissue culture wells (25 cm^) using the CellPect kit (Pharmacia). After 48 hours 

10 cells were Versene treated, washed and incubated with a mouse monoclonal IgG antibodies 
to HIVgp120 (NEA-9301, NEN™, Life-Science Products Inc., Boston) for time 30 min. on wet 
ice followed by washing in PBS, 3% FCS and incubation with Phyto-Erytrin (PE) labelled rat 
anti-mouse IgGI (Cat #346270, Becton Dickinson) according to the manufacture. After 
washing the cells were fixed in PBS, 1% paraformaldehyd, 3% FCS, and analysed on a 

15 FACS (FACScan, Becton-Dicknsson). Table 5 show in duplicate expression of BX08 gp160 
from 1 1 % of the cells transfected with wild type BX08 (number 1 and 2) compared to the 48 
% of cells expression BX08 glycoprotein when transfected with the synthetic gene (number 3 
and 4). Thus, a several fold higher expression is obtained using the synthetic BX08 gene. 

20 Table 5 FACS analysis of 293 cells transfected with synBX08gp160 (No 1 and 2) and 
wt.gp16+BX08 (No3 and 4) and stained with monoclonal antibodies to surface expressed 
HIV glycoproteins. A higher expression was obtained with the synthetic gene (mean 48%) as 
compared to the wild type gene (mean 1 1 %). 





50 ul 


45 ul 


A 


B 


C 


C-A 


C-B 


1 


syn.gp160BX08 + 


pBluescript SK+ 


2,57 


2,85 


36,91 


34,34 


34,06 


2 


syn.gp160BX08 + 


pBluescript SK-*- 


2,83 


2,14 


58,42 


55,59 


56,28 


3 


wt.gp160BX08 + 


pBluescript SK+ 


1,95 


1,52 


7,51 


5,56 


5,99 


4 


wt.gp1 608X08 + 


pBluescript SK+ 


2,97 


1,42 


14,41 


11,44 


12,99 



A: No primary antibody added (control for unspecific secondary Ab binding) 
25 B: Neither Primary Ab nor Secondary Ab added (autoflouroscense control) 
C: Primary Ab and secondary Ab added. 



Example 6D Analyses of the surface expression and biological functionality 

To analyse the surface expression and biological functionality from the wild type and 
30 synthetic BX08 envelope genes transfection experiments were done and cell fusion 
microscopically studied using HIV envelope receptor expressing cells. 
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10 i^g of the BX08 envelope plasmid (wt.BX08gp160 or syn.BX08gp160 or empty WRG7079 
vector plasmid) plus 5 |Lig of a plasmid (pEGFP, Clonetech) expressing green fluorescent 
protein (GFP) were transfected into 2x10® adherent U87.CD4.CCR5 cells (NIH AIDS Res. & 
Reference program, catalog #4035) stabely expressing CD4 and CCR5, using the CellPhect 
5 transfection kit (Pharmacia). After 48 hours the cells were examined by microscopy and 
photographed (Fig 22c). 

Fig 22c panel A show the negative control (empty WRG7079 plus pGFP) giving no syncytia. 
Panel B show cells transfected with the wild type BX08 gp160 plasmid where cell-to-cell 
fusion (syncytia) is seen. Panel C show cells transfected with the same amounts of 
10 synBX08gp160 plasmid and demonstrating a much higher degree of cell-cell fusion. In fact 
most or all of the cells in the culture plate were fused at this time. This experiment show 
surface expression of functional HIV gp160 with tropism to the CCR5 receptor, as well as a 
much higher expression and biological activity from the synthetic BX08 gene as compared to 
the wild type equivalent. 

15 

Example 7: Gene inoculation of mice for immunization 

6-7 weeks old female BALB/c mice were purchased from Bomholdtgaard, Denmak. 
Microbiological status was conventional and the mice were maintained in groups of 4/5 per 
cage with food and water ad libitum and artificially lighted 12 hours per day. Acclimatization 

20 period was 2 days. Mice were anaesthetized with 0.2 ml i.p. of rohypnolistesolid (1 :3, v/v) 
and DNA inoculated by either i.m. injection of 50 2 mg/ml of plasmid DNA in each tibia 
anterior muscle at week 0, 9, and 15 and terminated week 18; or gene gun inoculated on 
shaved abdominal skin using plasmid coated gold particles (0.95 ^m particles, 2 DNA/mg 
gold, 0.5 mg gold/shot, 50-71% coating efficiency) with the hand held Helios® gene gun 

25 device (BioRad) employing compressed (400 psi) Helium as the particle motive force. Mice 
were gene gun vaccinated at week 0, 3, 6, 9, 15, and terminated week 18. 

Example 8: Serological assays 

Western blotting. The induction of a humoral response to gp120 and gp41 antigens by in 
30 vivo expression of the encoded glycoproteins from the synthetic BX08 genes was examined 
by western immuno blotting (Figure 27). Mouse antisera (1:40) were evaluated in western 
blotting using the commercial HIV BLOT 2.2 strips (Genelabs Diagnostic). The conjugate 
was a 1 :200 dilution of the alkaline phosphatase-conjugated rabbit anti-mouse IgG 
(Dakopatts, Giostrup, Denmark). Buffers, incubation condition and colour development were 
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used according to the manufacturer. In these western blotting strips the gp160 band from 
HIV-1 IIIB contain of an oligomeric form of gp41 in a higher concentration than the 
monomeric gp41 band on the strip (Genelabs Diagnostic). HIV-Imib lysate is used in these 
commercial strips where the gp160 band is composed by addition of tetrameric gp41.AII 
5 preimmune sera tested negative in western blotting. Mice inoculated with syn.gp120Bxo8 
showed antibody response to the heterologous gp120 of HIV-1 IIIB. Inclusion of the 
extracellular part of gp41 in the gene syn.gp140Bxo8 induced antibody reaction to both gp120 
and gp41 in all mice. This confirms the in vivo expression of BX08 gp120 and the 
extracellular part of gp41. DNA vaccination with syn.gpieOsxos encoding the membrane 
10 bound glycoprotein induced antibodies to gp120 and gp41 in 50% and 64% of the mice, 
respectively. DNA vaccination with syn.gplSOBxos induced detectable antibodies to gp120 
and gp41 in 41% and 53%, respectively. Induction of different levels of antibodies could 
explain the difference in numbers of positive reactive mice sera in this qualitative western 
blotting. 

15 ELISA, Mouse anti HIV-1 gp120 antibodies were measured by indirect ELISA. Briefly, wells 
of polystyrene plates Maxisorb (Nunc) were coated for 2 days at room temperature with HIV- 
1 IIIB recombinant gp120 (Intracel) at 0.2 ^ig/lOO jxl of carbonate buffer, pH 9.6. Before use 
the plates were blocked 1 hour at room temperature with 150 p-l/well of washing buffer (PBS, 
0.5 M NaCI, 1% Triton-X-100) plus 2% BSA and 2% skimmilk powder. After 3 x 1 min. 

20 washings, mouse plasma was added at 100 |iil/well diluted in blocking buffer and ELISA 

plates incubated for 90 min. at room temperature using a microtiter plate shaker. As standard 
curve we used a mouse monoclonal antibody to a conserved part of gp120 between V5-C5 
(MRDNWRSELYKY) (#NEA-9301, NEN™ Life Science Products, Inc., Boston, MA). As 
calibration control included on each plate we used a plasma pool from 10 mice vaccinated 

25 with BX08 gp120. Plates were again washed 5x1 min. and incubated 1 hour at room 
temperature with 100 jal/well of HRP-conjugated rabbit anti-mouse IgG (#P260, Dakopatts, 
Glostrup, Denmark) diluted 1:1000 in blocking buffer. Colour was developed with 100 jul/well 
of peroxidase enzyme substrate consisting of 4 mg of o-phenylenediamine in 1 1 ml water 
plus 4 111 hydrogen peroxide (30%, w/w). The enzyme reaction was terminated after 30 min. 

30 by 150 |Lil/well of 1M H2SO4. The optical density (OD) of wells was measured at 492 nm using 
a microplate photometer (Molecular Devices, Biotech-Line, Denmark). Anti-HIV-gp120 IgG 
titers were expressed as the reciprocal plasma dilution resulting in an OD492nm value of 0.500. 
Mouse anti-HIV-1 BX08 antibodies were also measured by indirect peptide ELISAs as 
described above using a BX08 V3 peptide (SIHIGPGRAFYTTGD) (Schafer, Copenhagen, 

35 Denmark). 
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The IgG antibody response to HIV-ImB rgp120 quantitated by ELISA is seen in Fig. 28 
and Fig. 29. No bacicground activity was observed in preimmune sera or in sera from 4 mice 
immunized with empty WRG7079 vector in parallel with the BX08 genes. All mice inoculated 
with the synthetic BX08 genes either by gene gun or by i.m. injection responded and showed 
5 a persistent and high titered (about 100-10,000) IgG response to rgp120 as exemplified in 
Fig. 4. When comparing the median titers for groups of mice (Fig. 29) a moderate antibody 
response was observed with the wt.gplSOBxos. Intramuscular and gene gun immunization 
with a mixture of wt.gpieOexos plasmid plus Rev encoding plasmid did not increase this 
antibody response. This was found even when both plasmids were coated onto the same 

10 gold particles to ensure co-transfection of single target cells. However, to ensure inoculation 
of equal amounts of total DNA only half of the amount of wt.BXOS plasmid was used when 
mixing with pRev which may have contributed to the lower antibody response when pRev 
was included. A 5-fold improvement of the antibody response was obtained using the 
syn.gp160Bxo8 gene. This antibody response seemed further improved using the 

1 5 syn.gpl 50bxo8 gene where the cytoplasmic internalization signals were eliminated but only 
using gene gun inoculation. For both the gene gun inoculation of skin and i.m injection the 
highest antibody titers to rgp120 were induced by genes encoding secreted gp120/gp140 
glycoproteins versus membrane bound gp^50/gp^ 60 glycoproteins, respectively. In general, 
equal antibody and ELISA titers to rgp120 were obtained using gene gun and i.m. injection of 

20 the BX08 vaccine genes. 

Example 9: Neutralization assay 

Mouse plasma was diluted in culture medium (RPMI-1640 medium (Gibco) supplemented 
with antibiotics (Gibco), Nystatin (Gibco) and 10% FCS (Bodinco)) and heat inactivated at 
eO^'C for 30 min. Of the HIV-1 strain BX08 (50 TCID50 per ml propagated in PBMC) 250 |jl 

25 was incubated for 1 hour at room temperature with 250 pi dilution of mouse serum (four five- 
fold dilutions of mouse serum, final dilutions 1:20 to 1:2500). After incubation 1x10^ PBMC 
in 500 Ml culture medium was added to the virus-serum mixture and incubated overnight at 
37X in 5% CO2. Subsequently, eight replicates of 10® PBMC in 200 pi culture medium were 
cultured in 96-well culture plates (Nunc) at 37X in 5% CO2. After seven days in culture the 

30 concentration of HIV antigen in the culture supernatant was quantitated using HIV antigen 
detection ELISA (Nielsen et a!., 1987). 

This ELISA is perfomned using human IgG. purified from high titered patient sera, both as 
capture antibody and biotin-linked as detecting antibody. In brief, anti-HIV-capture IgG diluted 
1:4000 in PBS, 100 ^il/well, are coated onto Immunoplates (Nunc) ovemight at 4**C. After 
35 washing five times in washing buffer 100 ^il of supernatants are applied and incubated ovemight 
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at 4*^0. Plates are washed 5 times before incubation with 100 |al HIV-IgG conjugated with biotin 
diluted 1:1000 in dilution buffer, plus 10% HIV-1 sero-negative human plasma for 3 hours at 
room temperature. Five times 1 min. washing in washing buffer are followed by 30 min. 
incubation with 100 ^il of 1:1000 avidine-peroxidase (Dako P347 diluted in dilution buffer). Six 
5 times 1 min. washings, 5 in washing buffer and the last one are done in dH20 before colour is 
developed with 100 ^1 of peroxidase enzyme substrate consisting of 4 mg of OPD in 1 1 ml 
water plus 4 ^1 hydrogen peroxide (30 %, w/w). The enzyme reaction is terminated after 30 
minutes by additional 150 jul of 1M H2SO4. 

The HIV antigen concentration in cultures, preincubated with mouse serum, was expressed 
10 relatively to cultures without mouse serum (culture medium), and the percentage inhibitions 
of the different dilutions of mouse serum were calculated. The 50% inhibitory concentration 
(IC50) for each mouse serum was determined by interpolation from the plots of percent 
inhibition versus the dilution of serum, and the neutralizing titer of the serum was expressed 
as the reciprocal value of the IC50. In each set-up a human serum pool known to neutralise 
15 other HIV-1 strains was included in the same dilutions as the mouse serum as a calibratin 
control. For assay of neutralization of the heterologous SHIV89.6P the MT-2-cell-killing 
format was used (Crawford et al., 1999). The assay stock of SHIV89.6P was grown in human 
PBMC. 

The neutralizing IC50 antibody titers of plasma pools from 10 mice from each group 
20 were measured at different time points (week 0, 9, and 18). A positive background in some 
preimmune sera and thus in all week 0 serum pools was noted even after dilution and heat 
inactivation that was found earlier to lower this background. In general the neutralizing titers 
to BX08 virus of such serum pools were transient and low ranging from 1:6-1:150 above 
background (data not shown). A possible cross-neutralization reaction to a heterologous, 
25 primary HIV-1 envelope was tested using the SHIV89.6P which is relevant in macaque 
models of AIDS and serum pools from mice DNA immunized i.m. with syn.gp140Bxo8. 
Preimmune serum had a titer of 1:37, which is indicative of a slightly positive background, 
whereas the 18 week p.i. serum had a positive neutralizing titer of 1:254 above background. 

30 Example 10: CTL assay 

The cellular immune response in mice following gene gun or i.m. genetic immunization with 
the different vaccine plasmids were examined (Fig. 26). Spleen was removed aseptically and 
gently homogenised to single cell suspension, washed 3 times in RPMI-1640 supplemented 
with 10% FCS and resuspended to a final concentration of 5 x 10^ cell/ml. The cells were 
35 then incubated 5 days with mitomycin-C treated (50 |ag/ml for 1 hour) mouse P815 (H-2D'*) 
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Stimulator cells at a ratio of 10:1 in medium supplemented with 5 x 10'^ M p- 
mercaptoethanoL For assay of CTL response to HIV-1 BX08, P815 stimulator cells and 
target cells were pulsed with 20 ^ig/ml of the HIV-1 BX08 V3 peptide containing a conserved 
murine H-2D^ restricted CTL epitope (IGPGRAFYTT) (Lapham et al., 1996). After 
5 stimulation, splenocytes were washed three times with RPMI-1640 supplemented with 10% 
PCS and resuspended to a final concentration of 5 x 10® cells/ml. 100 jliI of cell suspension 
was added in triplicate to U-bottom 96-well microtiter plates and a standard 4 hour ^^Cr- 
release assay performed (Marker et al., 1973). 

All synthetic BX08 plasmids induced a high specific CTL response thus confirming the in vivo 
10 expression and in vivo immunogenicity. The highest CTL response was obtained with 
syn.gp150Bxo8 followed by syn.gp120Bxo8-syn.gp140Bxo8, and syn.gplSOBxoa, respectively. 
Thus, the CTL response induced did not correlate with the antigen being secreted or not. 
However, i.m. DNA immunization with syn.gplSOsxos containing six putative CpG motifs 
induced a higher CTL response than gene gun immunization (Fig. 26). This difference could 
15 be explained by the high amount of DNA used in the i.m. injections. 

The T-lymphocyte cytokine profile of spleen cells after ConA stimulation as well as serum 
antibody IgGsa/lgGi at week 18 were investigated. Neither the IFNy/lL-4 nor the lgG2a/lgGi 
ratios, which both reflects a Thi-type of immune response, were significantly higher for the 
i.m. immunized mice when compared with gene gun immunized mice (student t-test and 
20 Mann>Withney U-test). Thus, the CTL response did not correlate with a certain Th-type of 
response and the DNA immunization technique did not bias the immune response using 
synthetic BX08 genes. 

Example 11: Antibody responses to DNA vaccination with synBXOS env plasmid 

25 A relatively low and variable antibody response (1 of 10 mice) was obtained with gene gun 
inoculation of the syn.gp140BX08 plasmid vaccine starting at week 9, figure 23, right panel. 
A higher numbers of responders 3/10 with high IgGI antibody responses at an earlier onset 
(week 3-9) was obtained with the syn.gp140BX08 plasmid using i.m. injection, left panel. 
Sera from later time points may show more responders and/or higher titers but are not 

30 assayed. However, these results show the induction of an antibody response to the BX08 V3 
peptide by DNA vaccination using one of the described synthetic BX08 constructs. 
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Claims 

1 . A method for producing a nucleotide sequence construct comprising the following steps: 

a) obtaining a first nucleotide sequence of an HIV gene from a patient within the first 12 
5 months of infection; 

b) designing a second nucleotide sequence utilising the most frequent codons from 
mammalian highly expressed proteins to encode the same amino acid sequence as the 
first nucleotide sequence of a) encodes 

c) redesigning the second nucleotide sequence of b) so that restriction enzyme sites 

10 surrounds the regions of the nucleotide sequence which encode functional regions of the 
amino acid sequence and so that selected restriction enzyme sites are removed thereby 
obtaining a third nucleotide sequence encoding the same amino acid sequence as the 
first and the second nucleotide sequence of a) and b) encode; 

d) redesigning the third nucleotide sequence of c) so that the terminal snuts contain 
15 convenient restriction enzyme sites for cloning into an expression vehicle; 

e) producing the snuts between restriction enzyme sites of c) and terminal snuts of d); 

f) assembling the snut of step e) to form a nucleotide sequence construct. 

2. A method according to claim 1 , wherein the HIV gene is the gene encoding the envelope. 

20 

3. A method according to claim 1 or 2, wherein the HIV gene encodes one or more Gag 
proteins. 

4. A method according to any of the preceding claims, wherein the HIV in step a) is in group 
25 M, O or N 

5. A method according to claim 4, wherein the HIV is a group M virus. 

6. A method according to any of the preceding claims, wherein the HIV is subtype A, B, C, D, 
30 E, F, G, H, I, or J. 

7. A method according to claim 6, wherein the HIV is subtype B. 

8. A method according to any of the preceding claims wherein the first nucleotide sequence 
35 is obtained by direct cloning. 
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9. A method according to any of the preceding claims, wherein the HIV in step a) is isolated 
with the first 1 1 months of infection, such a 1 0, 9, 8, 7, 6, 5, 4, 3, 2, 1 , or 0.5 month after 
infection. 

10. A method according to any of the preceding claims, wherein the redesigning in step c) is 
carried out after the second nucleotide sequence of step b) has been divided into pieces, 
so that each piece comprises only different restriction enzyme sites. 

11. A method according to claim 10, wherein the second nucleotide sequence of step b) is 
divided into 9 pieces, or 8, or 7, or 6, or 5, or 4, or 3, or 2 pieces. 

12. A method according to claim 11, wherein the second nucleotide sequence of step b) is 
divided into 3 pieces. 

13. A method according to any of the preceding claims, wherein the second nucleotide 
sequence of step b) is designed utilising the most frequent codons from human highly 
expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence of step a) encodes. 

14. A nucleotide sequence construct obtainable by the method of any of claims 1-13. 

15. A nucleotide sequence construct according to claim 14, wherein the nucleotide sequence 
encoding the amino acid sequence in the first variable region is surrounded by EcoRV 
and Pst\ restriction enzyme sites. 

16. A nucleotide sequence construct according to claims 14 or 15, wherein the nucleotide 
sequence encoding the amino acid sequence in the second variable region is surrounded 
by Pst\ and C/al restriction enzyme sites. 

17. A nucleotide sequence construct according to any of claims 14-16, wherein the 
nucleotide sequence encoding the amino acid sequence in the third variable region is 
surrounded by C/al and EcoRI restriction enzyme sites. 
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18. A nucleotide sequence construct according to any of claims 14-17, wherein the 

nucleotide sequence encoding the amino acid sequence in the transmembrane spanning 
region is surrounded by HindlW and Sacll restriction enzyme sites. 

5 19. A nucleotide sequence construct according to any of claims 14-18, wherein the 

nucleotide sequence encoding the amino acid sequence on both sites of the cleavage site 
is surrounded by Pstl and Xba\ restriction enzyme sites. 

20. A nucleotide sequence construct in isolated form which has a nucleotide sequence with 
10 the general formula (I), (II), (III), or (IV) or subsequences thereof 

(I) Pl"S495ClarS650-720EcoRrP2-Si265gp120 

(II) Pl"S495ClarS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp140 

(III) Pl-S495ClarS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp150 

(IV) Pl'"S495ClarS660-720EcoRrP2-Si266Xhor Si465Pstr P4gp160- S2060Sacir Ps 

15 wherein Pi designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 

complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein 84950131 designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 
20 95%thereto; 

wherein S65o-72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

wherein P2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
25 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein Si265gpi2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 70% thereto; 

30 wherein Si265Xhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein Suespsti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
35 thereto; 
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wherein P4gpi4o designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P4gpi5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P4gpi6o designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S206osaci! designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 
wherein P5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 

15 thereto. 

21. A nucleotide sequence construct according to claim 20, with the formula (I) 

( I ) Pi -S495ClarS650-720EcoRr P2-S ^ 265gp1 20 

20 22. A nucleotide sequence construct according to claim 20, with the formula (II) 

(II) Pl-S495ClarS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp140 

23. A nucleotide sequence construct according to claim 20, with the formula (III) 

(III) Pl-S495C!arS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp150 

25 

24. A nucleotide sequence construct according to claim 20, with the formula (IV) 

(IV) Pl-S495ClarS650-720EcoRrP2-Si265Xhor Si465Pstr P4gp160" S2060Sacir Ps 

25. A nucleotide sequence construct according to claim 20 consisting essentially of the 
30 subsequence Pi. 

26. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S495ciai. 



35 



27. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S65o-72oecori- 
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28. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P2. 

5 29. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Si265gpi2o. 

30. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Si265Xhoi. 

10 

31. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Si465Psti- 

32. A nucleotide sequence construct according to claim 20 consisting essentially of the 
15 subsequence P4gpi4o. 

33. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P4gpi5o. 

20 34. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P4gpi6o. 

35. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Saoeosacii 

25 

36. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P5. 

37. A nucleotide sequence construct with a sequence identity of more than 85% to the 
nucleotide sequence construct in any of claims 20-35. 

38. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
more than 90% such as more than 95%, 98%, or 99%. 

39. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
100%. 
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40. A nucleotide sequence construct according to any of claims 14-39, coding for an HIV 
envelope or parts thereof with an improved immunogenicity obtained by mutating the 
nucleotide sequence construct of any of claims 14-39 such that one or more glycosylation 

5 sites in the amino acid sequence have been removed. 

41 . A nucleotide sequence construct according to claim 40 with a mutation at positions 
A307C + C309A and/or A325C + C327G and/or A340C + C342A and/or A385C + C387A 
and/or A469C + C471A or any combination of those. 

10 

42. A nucleotide sequence construct according to any of claims 14-41, coding for an HIV 
envelope or parts thereof with a binding site for the CXCR4 co-receptor in the third 
variable region. 

15 43. A nucleotide sequence construct according to claim 42 with a mutation at positions 
G865C + A866G. 

44. A nucleotide sequence construct according to any of claims 14-43, coding for an HIV 
envelope or parts thereof, wherein an immunodominant epitope has been modified. 

20 

45. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope in the third variable region has been modified. 

46. A nucleotide sequence construct according to claim 45 with a deletion of nucleotides 
25 793-897. 

47. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope has been removed from gp41. 

30 48. A nucleotide sequence construct according to any of claims 14-47, coding for an HIV 
envelope or parts thereof, wherein the cleavage site between gp41 and gp120 is 
removed. 

49. A nucleotide sequence construct according to claim 48 with a mutation at position 
35 C1423A. 
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50. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Pi, 849501311 S65o-72oecori, and P2. 

51 . A nucleotide sequence construct according to claim 20 consisting essentially of the 
5 subsequence Si265Xhoi, Si465Pstr. and 

52. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Si265Xhoii Si466Psti. P4gpi6o, S206osacJh and P5. 

10 53. A nucleotide sequence construct according to any of claims 14-52, further comprising a 
nucleotide sequence repeat coding for a functional region of the amino acid sequence. 

54. A nucleotide sequence construct according to claim 53, wherein the nucleotide sequence 
repeat codes for amino acids in the third variable region. 

15 

55. A nucleotide sequence construct according to any of claim 14-54, further comprising a 
nucleotide sequence coding for a T-helper cell epitope containing sequence. 

56. An expression vehicle selected from a group of viral vectors consisting of simliki forest 
20 virus (sfv), adenovirus and Modified Vaccinia Virus Ankara (MVA), further comprising a 

nucleotide sequence construct according to any of claim 14-55. 

57. A method of individualised immunotherapy wherein the virus from a newly diagnosed 
patient is directly cloned, the envelope is produced with highly expressed codons, inserted 

25 into any of the nucleotide sequence constructs of claims 14-55, and administered to the 
patient. 

58. Use of a nucleotide sequence construct according to any of claims 14-55 in medicine. 

30 59. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
vaccine for the prophylactics of infection with HIV in humans. 

60. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
composition for the treatment of an HIV infection in a human within 24 weeks of primary 
35 infection. 
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61 . Use of the nucleotide sequence according to any of claims 14-55 for the production of a 
recombinant protein. 
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Fig. 26 A CTL activity induced by 

different synthetic DNA vaccines 
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SEQUENCE LISTING 



<110> Statens Serum Institut 



<120> Method for producing an nucleotide 

sequence construct with optimised codons for an HIV envelope 
based on a primary, clinical HIV isolate and the BX08 
construct . 



<130> 21924DK1 



<160> 76 



<170> FastSEQ for Windows Version 3.0 



<210> 1 
<211> 243 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (243) 



<400> 1 

get age gcg gcc gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 



gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gcc aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 



gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



ccc acc gac ccc aac ccc cag gag gtg 
Pro Thr Asp Pro Asn Pro Gin Glu Val 
50 55 . 

aac ttc aac atg ggc aag aac aac atg 
Asn Phe Asn Met Gly Lys Asn Asn Met 
65 70 

ate 
He 



gtg ctg ggc aac gtg acc gag 192 
Val Leu Gly Asn Val Thr Glu 
60 

gtg gag cag atg cac gag gat 240 
Val Glu Gin Met His Glu Asp 
75 80 

243 



<210> 2 
<211> 81 
<212> PRT 

<213> Artificial Sequence 
<400> 2 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 
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Val 
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Val 






35 
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Glu 


Val 


Val 


Leu 


Gly 
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Val 


Thr 
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50 










55 










60 
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Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 

































<210> 3 
<211> 143 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (143) 

<400> 3 

gat ate ate age ctg tgg gac cag age etg aag eee tge gtg aag ctg 4 
Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 
15 10 15 



ace ccc etg tge gtg acc etg aac tge ace aag ctg aag aac age ace 9 

Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 

20 25 30 

gac acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tge ag 14 

Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 

35 40 45 



<210> 4 

<211> 47 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 

15 10 15 

Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 

20 25 30 

Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 
35 40 45 

<210> 5 
<211> 132 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (132) 

<400> 5 

ctg cag ctt caa cat cag cac cag cgt gcg caa caa gat gaa gcg cga 4 
Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 
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10 15 

96 

132 
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Pro 
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Arg 


Gin 






20 
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30 
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cag eta 
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cac 


ate 


gat 










His 
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Pro 
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Ala 


Gin 
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Gin 


His 


lie 


Asp 












35 










40 




















<210> 


6 
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<212> 


PRT 




























<213> 


Artificial Sequence 




















<400> 


6 


























Leu 


Gin Leu 


Gin 


His 


Gin 


His 


Gin 


Arg 


Ala 


Gin 


Gin 


Asp 


Glu 


Ala 


Arg 


1 






5 










10 










15 




Val 


Arg Pro 


Val 


Leu 


Gin 


Pro 


Gly 


His 


Arg 


Ala 


His 


Arg 


Gin 


Arg 


Gin 






20 










25 










30 






His 


Gin Leu 


Pro 


Pro 


Ala 


Gin 


Leu 


Gin 


His 


lie 


Asp 











35 40 





<210> 


7 




<211> 


161 




<212> 


DNA 




<213> 


Artificial ; 




<220> 






<221> 


CDS 




<222> 


(1) . . . (161) 




<400> 


7 


ate 


gat cat 


cac cca ggc 


He 


Asp His 


His Pro Gly 


1 




5 



g ccc caa ggt gag ett cga gcc cat ecc 4 8 

u Pro Gin Gly Glu Leu Arg Ala His Pro 
10 15 

cat cca ctt ctg cgc ccc cgc egg ett cgc cat cct gaa gtg caa caa 96 
His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 
20 .25 30 

caa gac ctt caa egg cac egg ccc ctg cac caa cgt gag cac cgt gea 144 
Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 
35 40 45 

gtg cac cca egg aat tc 161 
Val His Pro Arg Asn 
50 



<210> 8 
<211> 53 
<212> PRT 

<213> Artificial Sequence 
<400> 8 

He Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 
15 10 15 
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His Pro Leu Leu Arg Pro Arg Arg Leu Arg His Pro Glu Val Gin Gin 

20 25 30 

Gin Asp Leu Gin Arg His Arg Pro Leu His Gin Arg Glu His Arg Ala 

35 40 45 

Val His Pro Arg Asn 
50 



<210> 9 
<211> 254 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (254) 



<400> 9 

gaa ttc gcc ccg tgg tga gca ccc age tgc tgc tga acg gca gcc tgg 48 

Glu Phe Ala Pro Trp * Ala Pro Ser Cys Cys * Thr Ala Ala Trp 
15 10 



ccg agg agg agg tgg tga tea gat ctg aga act tea eea aca acg cca 96 
Pro Arg Arg Arg Trp * Ser Asp Leu Arg Thr Ser Pro Thr Thr Pro 
15 20 25 



aga cca tea teg tgc age tga acg aga gcg tgg aga tea act gca ccc 14 4 

Arg Pro Ser Ser Cys Ser * Thr Arg Ala Trp Arg Ser Thr Ala Pro 
30 35 40 



gcc cca aca aca aca ccc gca aga gca tee aca teg gcc ctg gcc gcg 192 
Ala Pro Thr Thr Thr Pro Ala Arg Ala Ser Thr Ser Ala Leu Ala Ala 
45 50 55 60 



cct tct aca cca ccg gcg aca tea teg gcg aca tec gcc agg ccc act 240 
Pro Ser Thr Pro Pro Ala Thr Ser Ser Ala Thr Ser Ala Arg Pro Thr 
65 70 75 



gca aca tct eta ga 
Ala Thr Ser Leu 
80 



254 



<210> 10 
<211> 80 
<212> PRT 

<213> Artificial Sequence 

<400> 10 
Glu Phe Ala Pro Trp Ala Pro Ser 

1 5 
Arg Arg Trp Ser Asp Leu Arg Thr 
20 

Ser Cys Ser Thr Arg Ala Trp Arg 

35 40 
Thr Pro Ala Arg Ala Ser Thr Ser 

50 55 
Pro Ala Thr Ser Ser Ala Thr Ser 
65 70 



Cys Cys Thr Ala Ala Trp Pro Arg 

10 15 
Ser Pro Thr Thr Pro Arg Pro Ser 
25 30 
Ser Thr Ala Pro Ala Pro Thr Thr 
45 

Ala Leu Ala Ala Pro Ser Thr Pro 
60 

Ala Arg Pro Thr Ala Thr Ser Leu 
75 80 
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<210> 11 
<211> 92 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (92) 

<400> 11 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 48 
Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
5 - 10 15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tc 92 
Arg Glu Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser 
20 25 30 



<210> 12 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 

15 10 15 

Arg Glu Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser 
20 25 30 

<210> 13 
<211> 130 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (130) 

<400> 13 

gag etc egg egg ega ccc cga gat cgt gat gea cag ett caa ctg egg 4 8 

Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 

15 10 15 

egg cga gtt ett eta ctg caa cae eac eca get gtt caa eag cac ctg 96 
Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 
20 25 30 

gaa cga gac caa cag ega ggg caa cat cae tag t 130 
Glu Arg Asp Gin Gin Arg Gly Gin His His * 
35 40 



<210> 14 

<211> 42 
<212> PRT 

<213> Artificial Sequence 



<400> 14 
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Glu 


Leu Arg 


Arg 


Arg Pro Arg 


Asp 


Arg 


Asp 


Ala 


Gin 


Leu 


Gin 


Leu 


Arg 


1 






5 






10 










15 




Arg 


Arg Val 


Leu 


Leu Leu Gin 


His 


His 


Pro 


Ala 


Val 


Gin 


Gin 


His 


Leu 






20 






25 










30 






Glu 


Arg Asp 


Gin 


Gin Arg Gly 


Gin 


His 


His 
















35 






40 




















<210> 


15 
























<211> 


164 
























<212> 


DNA 
























<213> 


Artificial Sequence 




















<220> 


























<221> 


CDS 
























<222> 


(1) . 


. . . (164) 






















<400> 


15 






















act 


agt ggc 


acc 


ate acc ctg 


ccc 


tgc 


cgc 


ate 


aag 


cag 


ate 


ate 


aae 


Thr 


Ser Gly 


Thr 


lie Thr Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 


He 


He 


Asn 


1 






5 






10 










15 





48 



atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc ate ggc ggc 96 
Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Gly Gly 
20 25 30 

cag ate aag tgc ctg age aae ate acc ggc ctg ctg ctg acc cgc gac 144 
Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
35 40 45 

ggc ggc age gac aae teg ag 164 
Gly Gly Ser Asp Asn Ser 
50 



<210> 16 
<211> 54 
<212> PRT 

<213> Artificial Sequence 

<400> 16 
Thr Ser Gly Thr He Thr Leu Pro 

1 5 
Met Trp Gin Glu Val Gly Lys Ala 

20 

Gin He Lys Cys Leu Ser Asn He 

35 40 
Gly Gly Ser Asp Asn Ser 
50 

<210> 17 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> {!)... (200) 



Cys Arg He Lys Gin He He Asn 

10 15 
Met Tyr Ala Pro Pro He Gly Gly 
25 30 
Thr Gly Leu Leu Leu Thr Arg Asp 
45 



<400> 17 



wo 00/29561 



PCT/DKOO/00144 



7 



etc gag cag egg caa gga gat ttt ccg cce egg egg egg ega eat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat ega 96 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

gee cet ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

cg^ gaa geg cgc egt ggg eat egg egc^-tat gtt ect egg ctt cet ggg 192 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

cgc tgc ag 200 
Arg Cys 
65 





<210> 


18 






















<211> 


66 






















<212> 


PRT 






















<213> 


Artificial Sequence 


















<400> 


18 




















Leu 


Glu Gin 


Arg 


Gin 


Gly Asp Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His Ala 


1 






5 






10 










15 


Arg 


Gin Leu 


Ala 


Gin 


Arg Ala Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp Arg 






20 






25 










30 




Ala 


Pro Gly 


His 


Arg 


Pro His Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala Ala 




35 






40 










45 






Arg 


Glu Ala 


Arg 


Arg 


Gly His Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro Gly 



50 55 60 

Arg Cys 
65 



<210> 19 
<211> 212 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (212) 

<400> 19 

etc gag cag egg caa gga gat ttt ccg cce egg egg egg ega eat geg 48 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat ega 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 



gee cet ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 



wo 00/29561 



8 



PCT/DKOO/00144 



cga gaa gcg cgc eta ggg cat egg cgc tat gtt cct egg ett cet ggg 192 

Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

cgc tgc age ecg ggg gat cc 212 

Arg Cys Ser Pro Gly Asp 

65 70 



<210> 20 
<211> 70 
<212> PRT 

<213> Artificial Sequence 
<400> 20 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

1 5 10 15 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

Arg Cys Ser Pro Gly Asp 
65 70 

<210> 21 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (200) 

<400> 21 

etc gag cag egg caa gga gat ttt ecg ccc egg egg egg cga eat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gcc cct ggg cat cgc ccc cac caa ggc caa gcg ecg egt ggt gea gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gag cgc egt ggg cat egg cgc tat gtt eet egg ett ect ggg 192 
Arg Glu Glu Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc ag 200 
Arg Cys 
65 



<210> 22 



wo 00/29561 
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9 





<211> 


66 
























<212> 


PRT 
























<213> 


Artificial Sequence 




















<400> 


22 






















Leu 


Glu Gin 


Arg 


Gin 


Gly Asp Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 


1 






5 






10 










15 




Arg 


Gin Leu 


Ala 


Gin 


Arg Ala Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 






20 






25 










30 






Ala 


Pro Gly 


His 


Arg 


Pro His Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala 


Ala 




35 






40 










45 








Arq 


Glu Glu 


Arg 


Arg 


Gly His Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 



50 55 60 

Arg Cys 
65 



<210> 23 
<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (178) 

<400> 23 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 48 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cys Arg 
15 10 



ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 



gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 14 4 

Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age age tec agg ccc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ser Arg Pro Ala Cys Trp Leu * 
45 50 





<210> 


24 




















<211> 


54 




















<212> 


PRT 




















<213> 


Artificial Sequence 




















<400> 


24 


















Leu 


Gin Ala 


Ala Pro Trp Ala Pro 


Pro 


Ala 


Pro 


Pro 


Cys 


Arg 


Pro 


Ala 


1 




5 




10 










15 




Ser 


Cys Ala 


Ala Ser Cys Ser Ser 


Arg 


Thr 


Thr 


Cys 


Cys 


Ala 


Pro 


Ser 






20 


25 










30 






Arg 


Pro Ser 


Ser Thr Cys Ser Ser 


Pro 


Cys 


Gly 


Ala 


Ser 


Ser 


Ser 


Ser 




35 


40 










45 








Arg 


Pro Ala 


Cys Trp Leu 



















50 



<210> 25 



wo 00/29561 
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<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . _ (178) 

<400> 25 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 4 8 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cvs Ara 
1 5 10 

ccc gcc age tgc tga gcg gca teg tgc age age aga aca ace tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 

gcg cca teg agg ccc age age ace tgc tec age tga ccg tgt ggg gca 14 4 

Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age agt get gcg gcc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ala Ala Ala Ala Cys Trp Leu * 
45 50 



<210> 26 
<211> 54 
<212> PRT 

<213> Artificial Sequence 

<400> 26 
Leu Gin Ala Ala Pro Trp Ala Pro 

1 5 
Ser Cys Ala Ala Ser Cys Ser Ser 
20 

Arg Pro Ser Ser Thr Cys Ser Ser 

35 40 
Ala Ala Ala Cys Trp Leu 
50 



Pro Ala Pro Pro Cys Arg Pro Ala 

10 15 
Arg Thr Thr Cys Cys Ala Pro Ser 
25 30 
Pro Cys Gly Ala Ser Ser Ser Ala 
45 



<210> 27 
<211> 77 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (77) 



<400> 27 

tct aga gcg eta cet cca gga cca gcg ett cet ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

etc egg caa get gat ctg cae cac gge eg 77 
Leu Arg Gin Ala Asp Leu His His Gly 
20 25 
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<210> 28 
<211> 25 
<212> PRT 

<213> Artificial Sequence 



<400> 28 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly 
20 25 



<210> 29 
<211> 190 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<221> CDS 

<222> (1) . . . (190) 



<400> 29 

egg ccg tgc cct gga acg cca get gga gca aca aga acc tga gcc aga 
Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr * Ala Arg 
1 5 10 15 



ttt ggg aca aca tga cct gga tgg agt ggg age gcg aga tea gca act 
Phe Gly Thr Thr * Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr 

20 25 30 



aca ccg aga tea tct aca gcc tga teg agg aga gcc aga acc age agg 
Thr Pro Arg Ser Ser Thr Ala * Ser Arg Arg Ala Arg Thr Ser Arg 
35 40 45 



aga aga acg age tgg acc tgc tec age tgg aca agt ggg eaa get t 190 
Arg Arg Thr Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 
50 55 60 



<210> 30 

<211> 60 

<212> PRT 

<213> Artificial 



Sequence 



<400> 30 
Arg Pro Cys Pro 
1 

Gly Thr Thr Pro 
20 

Arg Ser Ser Thr 
35 

Ser Trp Thr Cys 
50 



Gly Thr Pro Ala 
5 

Gly Trp Ser Gly 

Ala Ser Arg Arg 
40 

Ser Ser Trp Thr 
55 



Gly Ala Thr Arg 
10 

Ser Ala Arg Ser 
25 

Ala Arg Thr Ser 

Ser Gly Gin Ala 
60 



Thr Ala Arg Phe 
15 

Ala Thr Thr Pro 
30 

Arg Arg Arg Thr 
45 



<210> 31 
<211> 177 
<212> DNA 

<213> Artificial Sequence 
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<220> 






<221> 


CDS 




<222> 


(1) 




<400> 


31 




ctt 


gtg 


gaa 


Lys 


Leu 


Val 


Glu 


1 








gat 


ttt 


cat 


cat 


Asp 


Phe 


His 


His 








20 


cac 


cgt 


get 


gag 


His 


Arg 


Ala 


Glu 






35 





. . (177) 

ctg gtt caa cat cac caa ctg get gtg gta cat caa 48 

Leu Val Gin His His Gin Leu Ala Val Val His Gin 
5 10 15 

gat cgt ggg egg cct gat egg cct gcg cat cgt gtt 96 
Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 
25 30 

cat cgt gaa ccg cgt gcg cca ggg eta cag ccc cct 14 4 

His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 
40 45 

gag ctt cca gac ccg cct gcc cgt gcc ccg egg 177 
Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 



<210> 32 
<211> 59 
<212> PRT 

<213> Artificial Sequence 

<400> 32 

Lys Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin 

15 10 15 

Asp Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 

20 25 30 

His Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 

35 40 45 

Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 

<210> 33 
<211> 140 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (140) 

<400> 33 

ccg egg ccc cga ccg ccc ega ggg cat ega gga gga ggg egg cga gcg 4 8 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Gly Gly Arg Arg Ala 
15 10 15 

cga ccg cga ccg cag cac ccg cct ggt gac egg ctt cct gcc cct gat 96 
Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 

20 25 30 

ctg gga ega cct gcg cag cct gtt cct gtt cag eta cca teg at 14 0 

Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 
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<210> 


34 


















<211> 


46 


















<212> 


PRT 


















<213> 


Artificial Sequence 


















<400> 


34 
















Pro 


Arg Pro 


Arg Pro Pro Arg Gly 


His 


Arg 


Gly 


Gly 


Gly Arg 


Arg 


Ala 


1 




5 




10 








15 




Arg 


Pro Arg 


Pro Gin His Pro Pro 


Gly 


Asp 


Arg 


Leu 


Pro Ala 


Pro 


Asp 






20 


25 








30 






Leu 


Gly Arg 


Pro Ala Gin Pro Val 


Pro 


Val 


Gin 


Leu 


Pro Ser 








35 


40 










45 







<210> 35 

<211> 129 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (129) 

<400> 35 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tge 4 8 

lie Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
15 10 15 

tgg gee ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tee 96 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 
20 25 30 

agt act gga gcc agg age tga aga act ctg cag 129 
Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin 
35 40 





<210> 


36 




















<211> 


40 




















<212> 


PRT 




















<213> 


Artificial Sequence 




















<400> 


36 


















He 


Asp Cys 


Ala Thr Cys Cys Ser 


Trp 


Pro 


Ala 


Ser 


Trp 


Ser 


Cys 


Trp 


1 




5 




10 










15 




Ala 


Gly Ala 


Ala Gly Arg Ser Ser 


Thr 


Gly 


Gly 


Thr 


Cys 


Ser 


Ser 


Thr 






20 


25 










30 






Gly 


Ala Arg 


Ser Arg Thr Leu Gin 



















35 40 



<210> 37 
<211> 114 

<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) • • , (114) 
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<400> 37 

ctg cag tga gcc tgc tga acg cca ccg cca teg ccg tgg ccg agg gca 4 8 

Leu Gin * Ala Cys * Thr Pro Pro Pro Ser Pro Trp Pro Arg Ala 
1 5 10 



ccg acc gcg tga teg agg tgg tgc age gca tet gge geg gca tec tgc 96 
Pro Thr Ala * Ser Arg Trp Cys Ser Ala Ser Gly Ala Ala Ser Cys 
15 20 25 

aca tec cca ccc gaa ttc 114 
Thr Ser Pro Pro Glu Phe 
30 35 





<210> 


38 


















<211> 


35 


















<212> 


PRT 


















<213> 


Artificial Sequence 


















<400> 


38 
















Leu 


Gin Ala 


Cys Thr Pro Pro Pro 


Ser Pro 


Trp 


Pro 


Arg 


Ala 


Pro 


Thr 


1 




5 


10 










15 




Ala 


Ser Arg 


Trp Cys Ser Ala Ser 


Gly Ala 


Ala 


Ser 


Cys 


Thr 


Ser 


Pro 






20 


25 








30 






Pro 


Glu Phe 



















35 



<210> 39 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (41) 

<400> 39 

gaa ttc gcc agg get teg age geg ccc tgc tgt aag gat cc 41 
Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
15 10 



<210> 40 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<400> 40 

Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
15 10 

<210> 41 

<211> 506 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (506) 
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<400> 41 

get age gcg gcc gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gcc aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg ace 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg ace ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

ace aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg at 506 
Arg Leu Arg Ser Cys Asn Thr Ser 
165 



<210> 42 
<211> 168 
<212> PRT 

<213> Artificial Sequence 

<400> 42 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 
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Pro 


Thr 
50 


Asp 


Pro 


Asn 


Pro 


Gin 
55 


Glu 


Val 


Val 


Leu 


Gly 
60 


Asn 


Val 


Thr 


Glu 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


lie 


lie 


Ser 


Leu 


Trp 

85 


Asp 


Gin 


Ser 


Leu 


Lys 
90 


Pro 


Cys 


Val 


Lys 


Leu 
95 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 




Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 






Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 








Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


Ile^^sp Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 
165 


Asn 


Thr 


Ser 



















<210> 43 
<211> 374 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) • . . (374) 

<400> 43 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 48 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
15 10 15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tec ggc 96 
Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly 
20 25 30 

ggc gac ccc gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc 144 
Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe 
35 40 45 

ttc tac tgc aac acc acc cag ctg ttc aac age acc tgg aac gag acc 192 
Phe Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr 
50 55 60 

aac age gag ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc 240 
Asn Ser Glu Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg 
65 70 75 80 

ate aag cag ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac 288 
He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 
85 90 95 

gcc ccc ccc ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc 336 
Ala Pro Pro He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly 
100 105 110 



ctg ctg ctg acc cgc gac ggc ggc age gac aac teg ag 
Leu Leu Leu Thr Arg Asp Gly Gly Ser Asp Asn Ser 
115 120 



374 
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<210> 44 
<211> 124 
<212> PRT 

<213> Artificial Sequence 
<400> 44 



Ser 


Arg 


Thr 


Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


1 








5 










10 










15 




Arg 


Glu 


Lys 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 








20 










25 










30 






Glv 


Asp 


Pro 


Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 






35 










40 










45 








Phe 


Tyr 


Cys 


Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 




50 










55 










60 










Asn 


Ser 


Glu 


Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


65 










70 










75 










80 


lie 


Lys 


Gin 


He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 










85 










90 










95 




Ala 


Pro 


Pro 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 








100 










105 










110 






Leu 


Leu 


Leu 


Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 











115 120 



<210> 45 
<211> 1277 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1277) 

<400> 45 
get age gcg gcc gac cgc 
Ala Ser Ala Ala Asp Arg 
1 5 

gtg tgg aag gac gcc acc 
Val Trp Lys Asp Ala Thr 
20 

gcc tac gac acc gag gtg 
Ala Tyr Asp Thr Glu Val 
35 

ccc acc gac ccc aac ccc 
Pro Thr Asp Pro Asn Pro 
50 

aac ttc aac atg ggc aag 
Asn Phe Asn Met Gly Lys 
65 70 

ate ate age ctg tgg gac 
He He Ser Leu Trp Asp 
85 



ctg tgg gtg 
Leu Trp Val 



acc ace ctg 
Thr Thr Leu 
25 

cac aac gtg 
His Asn Val 
40 

eag gag gtg 
Gin Glu Val 
55 

aac aac atg 
Asn Asn Met 



eag age ctg 
Gin Ser Leu 



acc gtg tac 
Thr Val Tyr 

10 

ttc tgc gcc 
Phe Cys Ala 



tgg gcc ace 
Trp Ala Thr 



gtg ctg ggc 
Val Leu Gly 
60 

gtg gag cag 
Val Glu Gin 
75 

aag ccc tgc 
Lys Pro Cys 
90 



tac ggc gtg 
Tyr Gly Val 
15 

age gac gee 
Ser Asp Ala 
30 

cac gcg tgc 
His Ala Cys 
45 

aac gtg acc 
Asn Val Thr 



atg cac gag 
Met His Glu 



gtg aag ctg 
Val Lys Leu 
95 



ccc 4 8 

Pro 



aag 96 
Lys 



gtg 144 
Val 



gag 192 
Glu 



gat 240 
Asp 
80 

acc 288 
Thr 



ccc ctg tgc gtg acc ctg aac tgc ace aag ctg aag aac age acc gac 



336 
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Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ecc ate gae aac gac aac acc age tac 480 
■Ptte Tyr Ser Leu Asp lie Val Pro Il-e-Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ecc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ecc ate ecc ate cac ttc tgc gee ecc gcc ggc ttc 576 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ecc tgc 624 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ecc gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gcc gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gcc aag acc ate ate gtg cag 768 
lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ecc aac aac aac acc 816 
Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 • 270 

cgc aag age ate cac ate ggc cct ggc cgc gcc ttc tac acc acc ggc 8 64 

Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gcc cac tgc aac ate tct aga ace 912 
Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg acc aac ace ctg aag cgc gtg gcc gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ecc 1008 
Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 
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gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 
Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 



1056 



aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 



1104 



ggc aac ate act agt ggc ace ate ace ctg cec tgc cge ate aag cag 
Gly Asn lie Thr Ser Gly Thr lie Thr Leu Pro Cys Arg lie Lys Gin 
370 375 380 



1152 



artr: ate aac atg tgg cag gag gtg ggcraag gcc atg tac gee cec cec 
lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 



1200 



ate ggc ggc cag ate aag tgc ctg age aac ate ace ggc ctg ctg ctg 
lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 
405 410 415 



1248 



acc cgc gac ggc ggc age gac aac teg ag 
Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



1277 



<210> 46 
<211> 425 
<212> PRT 

<213> Artificial Sequence 



<400> 46 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 




50 








55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Asn 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 



210 215 220 



wo 00/29561 
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20 



Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


lie 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lvs 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lvs 


Ser 


He 


His 


He 


Gly 


Pro 


Gly 


Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


lie 


He 


Glv 

v_» J. _y 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cvs 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 

IT 


Thr 


Asn 


Thr 


Leu 


Lys 


Arq 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 




Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn-Gln 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


lie 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Glv 


Glu 


Phe 


Phe 


Tyr 


Cys 

J: 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










lie 


He 


Asn 


Met 


Trp 

4. J- ^ 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


lie 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 






















420 










425 


















<210> 


47 




























<211> 


1277 


























<212> 


DNA 




























<213> 


Artificial Sequence 




















<220> 






























<221> 


CDS 




























<222> 


(1) « 


. . . (1277) 
























<^ 


100> 


47 


























get 


age 


gcg 


gcc 


gae 


cgc 


etg 


tgg 


gtg 


acc 


gtg 


tac 


tac 


ggc 


gtg 


ccc 


Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




gtg 


tgg 


aag 


gac 


gcc 


acc 


acc 


acc 


etg 


ttc 


tgc 


gcc 


age 


gac 


gcc 


aag 


Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






gcc 


tac 


gac 


acc 


gag 


gtg 


cac 


aac 


gtg 


tgg 


gee 


acc 


cae 


gcg 


tgc 


gtg 


Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








ccc 


acc 


gac 


ccc 


aac 


ccc 


cag 


gag 


gtg 


gtg 


etg 


ggc 


aac 


gtg 


acc 


gag 


Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 




50 










55 










60 










aac 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


atg 


gtg 


gag 


cag 


atg 


cac 


gag 


gat 


Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


ate 


ate 


age 


etg 


tgg 


gae 


eag 


age 


etg 


aag 


eee 


tgc 


gtg 


aag 


etg 


ace 


lie 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 
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85 90 95 

ccc ctg tgc gtg acc ctg caa tgc acc aag ctg aag cag age acc gac 336 
Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 
100 105 110 

acc cag aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age ace age gtg cgc aac aag atg aag cgc gag tae gee ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tae 480 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 . 155 160 

cgc ctg cgc age tge aac aca teg ate ate ace eag gee tgc eee aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gcc ggc ttc 57 6 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

ace aac gtg age ace gtg cag tgc ace eac gga att cgc eee gtg gtg 672 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age etg gee gag gag gag gtg gtg 720 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gcc aag acc ate ate gtg cag 768 
lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc ace cgc eee aac aac aac acc 816 
Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

ege aag age ate cae ate ggc eet gge ege gee ttc tae ace ace ggc 864 
Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate gge gac ate cgc cag gee cae tge aac ate tct aga acc 912 
Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg cgc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 



ttc aac aac acc ace ate gtg ttc aac eag age tec ggc ggc gac ccc 



1008 
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Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

gag ate gtg atg cac age tte aae tge ggc ggc gag ttc tte tae tgc 1056 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

aac acc acc cag ctg ttc aac age ace tgg aae gag aee aae age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

ggc aac ate act agt ggc acc ate acc ctg cec tge egc ate aag cag 1152 

TS±y Asn lie Thr Ser Gly Thr lie Thir-Leti Pro Cys Arg lie Lys Gin 

370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ecc eee 1200 

lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

ate ggc ggc eag ate aag tge ctg age aae ate ace ggc ctg ctg ctg 1248 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 

405 410 415 

acc egc gae ggc ggc age gac aac teg ag 1277 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 

420 425 



<210> 48 
<211> 425 
<212> PRT 

<213> Artificial Sequence 
<400> 48 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 


Lys 


Leu 


Lys 


Gin 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Gin 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 
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X y D 




















9ns 








rp U, v~ 


Asn 


V ax 


Ser 


Tnir 


V a X 


Ljxn 


v^y s 


i nr 


HXS 


Lixy 


X X 


i-ix y 


IT J_ W 


V;=i 1 
V d J. 


V d X 




9 1 n 

^ X u 










215 










220 










IT 


i nir 


Lixn 


Leu 


Leu 


Leu 


Asn 


vjxy 


Ser 


Leu 


riXa 


OX Li 


ox Ll 




Val 


Val 


9 9 R 










9 

^ o u 










9 "^S 
^ «j .J 










240 




7\ >" 


oe j_ 


vjxu 


Asn 


irne 


i I IX 


7\ C? T~l 

M.O 11 


7\ o n 


i-iXcl 


Lys 


Thr- 


He 


He 


Val 


Gin 










9 4 S 










250 










255 






/-io li 


oX U 


Q 


V 3X 


X u. 


X X 


7\ c: -n 
/-io 1 1 




J. I IX 


7\ 7" 
i^X y 


IT J. W 


r^o 1 i 


A «? n 

11 


Asn 


Thr 








9 

^ D U 










9 S 










270 








s 


oe j_ 


X X e 


rixs 


X xe 


vjxy 


IT X %_J 


oxy 


jtIX y 


r^X ct 


Phe 


1 yr 


Thr 


Thr 


Gly 






97 R 










9 RO 
^ o u 










^ O 










X X c 


T 1 

X xt? 


kjxy 




X X fc; 




m n 

V3 J. ii 




His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




9 on 










9 Q^^ 




















Asn 


Trp 


1 niT 


Asn 


J. niT 


Leu 


Lys 


Arg 


V aX 


x-iXct 


ox u. 


Lys 


Leu 


Arg 


Glu 


j-ty s 


J u o 










O X u 










o J. o 










320 


irne 


Asn 


Asn 


i nx 


1 nx 


X xe 


V aX 


irne 


Asn 


oxn 


oex 


OCT X 


vjxy 


Gly Asp 


IT X \J 










*^9 R 










•J o u 










335 




LiXU. 


± ±e 


va.x 


MeL. 


rlxS 


Ser 


Irne 


Asn 


Cys 


vjX y 


^x y 


oX u. 


iri Its 


Phe 


Tyr 


^^y o 








d n 

O 4 u 




















350 






Asn 


i nir 


i nir 


taXn 


Leu 


irne 


Asn 


C v~ 

oer 


i nx 


irp 


Zi o r-k 


*cjX LI 


1 1 IX 


Asn 


Ser 


^jX LI 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










lie 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 

















420 425 



<210> 49 
<211> 144 
<212> PRT 

<213> Artificial Sequence 





<400> 


get 


age 


gcg 


gtg 


tgg 


aag 


gcc 


tac 


gac 


ccc 


acc 


gac 


aac 


ttc 


aac 


ate 


ate 


age 


ccc 


ctg 


tge 


acc 


cag 


aac 


cag 


ate 


age 


ttc 


tac 


age 


cgc 


ctg 


cgc 


gtg 


age 


ttc 


gcc 


ate 


ctg 


acc 


aac 


gtg 


ago 


acc 


cag 


ate 


aga 


tet 


ctg 


aac 


gag 


cgc 


aag 


age 


gac 


ate 


ate 


aac 


tgg 


acc 


ttc 


aac 


aac 


gag 


ate 


gtg 


aac 


acc 


ace 



49 






gcc 


gac 


cgc 


gac 


gee 


acc 


acc 


gag 


gtg 


ccc 


aac 


ccc 


atg 


ggc 


aag 


ctg 


tgg 


gac 


gtg 


acc 


ctg 


acc 


cgc 


tgg 


acc 


age 


gtg 


ctg 


gac 


ate 


age 


tge 


aac 


gag 


ccc 


ate 


aag 


tge 


aac 


age 


acc 


gtg 


ctg 


ctg 


ctg 


gag 


aac 


ttc 


age 


gtg 


gag 


ate 


eac 


ate 


ggc 


gac 


ate 


aac 


ace 


ctg 


ace 


ace 


ate 


atg 


cac 


age 


cag 


ctg 


ttc 



ctg tgg gtg 
ace acc ctg 
cac aac gtg 
cag gag gtg 
aac aac atg 
cag age ctg 
caa tge acc 
ggc acc cag 
cgc aac aag 
gtg ccc ate 
aca teg ate 
ccc ate cac 
aac aag ace 
cag tge acc 
aac ggc age 
ace aac aac 
ate aac tge 
ggc cct ggc 
cgc cag gcc 
aag cgc gtg 
gtg ttc aac 
ttc aac tge 
aac age ace 



ace gtg tac 
ttc tge gcc 
tgg gee ace 
gtg ctg ggc 
gtg gag cag 
aag ccc tge 
aag ctg aag 
gag atg aag 
atg aag cgc 
gac aac gac 
ate ace cag 
ttc tge gcc 
ttc aac ggc 
cac gga att 
ctg gcc gag 
gcc aag ace 
acc cgc ccc 
cgc gcc ttc 
cac tge aac 
gee gag aag 
cag age tec 
ggc ggc gag 
tgg aac gag 



tac ggc gtg 
age gac gcc 
cac gcg tge 
aac gtg ace 
atg eac gag 
gtg aag ctg 
cag age acc 
aac tge age 
gag tac gee 
cag acc age 
gee tge ccc 
ccc gcc ggc 
acc ggc ccc 
cgc ccc gtg 
gag gag gtg 
ate ate gtg 
aac aac aac 
tac acc acc 
ate tct aga 
ctg cgc gag 
ggc ggc gac 
ttc ttc tac 
acc aac age 



ccc 48 

aag 96 

gtg 144 

gag 192 

gat 240 

acc 288 

gac 336 

ttc 384 

ctg 432 

tac 480 

aag 528 

ttc 576 

tge 624 

gtg 672 

gtg 720 

cag 768 

acc 816 

ggc 864 

acc 912 

aag 960 

ccc 1008 

tge 1056 

gag 1104 
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ggc 


aac 


ate 


act 


aat 


acre 


aee 


ate 


aee 


etg 


ecc 


tgc 


cgc 


ate 


aag 


eag 


ate 


ate 


aae 


atg 


tgg 


eag 


gag 


gtg 


ggc 


aag 


gee 


atg 


t ac 


gcc 


ecc 


ecc 


ate 


gge 


gge 


cag 


ate 


aag 


tgc 


etg 


age 


aae 


ate 


acc 


ggc 


etg 


etg 


etg 


acc 


cgc 


gac 


gge 


ggc 


age 


gae 


aae 


teg 


ao 
















<210> 


50 




























<211> 


425 




























<212> 


PRT 




























<213> 


Artificial : 


Sequence 




















<400> 


50 


























Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 










5 










10- 










15 




Val 


Trp 


Lys 


Asp Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lvs 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 


Lvs 
J. 


Leu 


Lys 


Gin 


Ser 


Thr 


Asp 








100 










105 










110 




Thr 


Gin 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Gin 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tvr 


Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Gin 


Thr 


Ser 


Tvr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Glv 


Phe 








180 










185 










190 




Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 










200 










205 








Thr 


Asn 


Val 


Ser 


Thr 


val 


(jjln 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 




210 










215 










220 










Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










O O A 










235 










240 


He 


Arg 


Ser 


Glu 


Asn 


Fne 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


He 


He 


Gly Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lvs 


305 










310 










315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 











1152 
1200 
1248 
1277 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 51 
<211> 144 
<212> PRT 
<213> Artificial 



Sequence 





<400> 


51 






get 


age 


gcg 


gcc 


gac 


cgc 


gtg 




aag 


gac 


gee 


acc 


gcc 


L. dv^ 


gac 


acc 


y dy 


y *-y 


ccc 


acc 


ga c 


ccc 


aac 


CCC 


a 


ttc 


aac 


at g 






ate 


ate 


age 


c t g 


'-y y 


gac 


ccc 


ctg 


t gc 


y «-y 


acc 


ctg 


acc 


aac 


aac 


ace 


cgc 




cag 


ate 


age 


acc 


age 


gtg 


ttc 


tac 


age 


ctg 


gac 


ate 


cgc 


ctg 


cgc 


age 


tgc 


aac 


gtg 


age 


ttc 


gag 


ccc 


ate 


gcc 


ate 


ctg 


aag 


tgc 


aac 


acc 


aac 


gtg 


age 


acc 


gtg 


age 


ace 


cag 


ctg 


ctg 


ctg 


ate 


aga 


tct 


gag 


aac 


ttc 


ctg 


aac 


gag 


age 


gtg 


gag 


cgc 


aag 


age 


ate 


cac 


ate 


gac 


ate 


ate 


ggc 


gac 


ate 


aac 


tgg 


ace 


aac 


ace 


ctg 


ttc 


aac 


aac 


acc 


acc 


ate 


gag 


ate 


gtg 


atg 


cac 


age 


aac 


acc 


acc 


cag 


ctg 


ttc 


gge 


aac 


ate 


act 


agt 


ggc 


ate 


ate 


aac 


atg 


tgg 


cag 


ate 


ggc 


ggc 


cag 


ate 


aag 


acc 


cgc 


gac 


ggc 


ggc 


age 



<210> 52 
<211> 425 
<212> PRT 
<213> Artificial 



ctg 


•-y y 


y '-y 


acc 


y '-y 


tac 


acc 


acc 


ct g 


ttc 


tgc 


gcc 


cac 


aac 


Qtg 


tgg 


gcc 


acc 


cag 


gag 


guy 


y "-y 


^ uy 


y y ^ 


aac 


aac 


atg 


y '-y 


y «=^y 


cag 


o o y 


age 


ctg 


aag 


ccc 


tgc 


aac 


u y o 


acc 


aag 


ctg 


aag 


ggc 


acc 


cag 


gag 


atg 


aag 


cgc 


aac 


aag 


atg 


aag 


cgc 


gtg 


ccc 


ate 


gac 


aac 


gac 


aca 


teg 


ate 


ate 


acc 


cag 


ccc 


ate 


cac 


ttc 


tgc 


gcc 


aac 


aag 


acc 


ttc 


aac 


ggc 


cag 


tgc 


acc 


cac 


gga 


att 


aac 


ggc 


age 


ctg 


gcc 


gag 


acc 


aac 


aac 


gcc 


aag 


acc 


ate 


aac 


tgc 


acc 


cgc 


ccc 


ggc 


cet 


ggc 


cgc 


gcc 


ttc 


cgc 


cag 


gcc 


cac 


tgc 


aac 


aag 


cgc 


gtg 


gcc 


gag 


aag 


gtg 


ttc 


aac 


cag 


age 


tec 


ttc 


aac 


tgc 


ggc 


gge 


gag 


aac 


age 


acc 


tgg 


aac 


gag 


acc 


ate 


acc 


ctg 


ccc 


tgc 


gag 


gtg 


ggc 


aag 


gcc 


atg 


tgc 


ctg 


age 


aac 


ate 


acc 


gac 


aac 


teg 


ag 







ence 



tac 


QCC 

y y ^ 


at a 


ccc 


48 


age 


gac 


gee 


aag 


96 


cac 


y ^y 


tgc 


at a 

y u y 


14 4 


aac 


y "-y 


acc 


aaa 

y ay 


192 


atg 


cac 


rr A rr 
y dy 


gat 


240 


at a 


aag 


ctg 


acc 


288 


aac 


age 


acc 


gac 


336 


aac 


tgc 


age 


ttc 


384 


gag 


tac 


gcc 


ctg 


432 


cag 


acc 


age 


tac 


480 


gee 


tgc 


ccc 


aag 


528 


ccc 


gee 


ggc 


ttc 


576 


acc 


ggc 


ccc 


tgc 


624 


cgc 


ccc 


gtg 


gtg 


672 


gag 


gag 


gtg 


gtg 


720 


ate 


ate 


gtg 


cag 


768 


aac 


aac 


aac 


acc 


816 


tac 


acc 


acc 


ggc 


864 


ate 


tct 


aga 


acc 


912 


ctg 


cgc 


gag 


aag 


960 


ggc 


ggc 


gac 


ccc 


1008 


ttc 


ttc 


tac 


tgc 


1056 


acc 


aac 


age 


gag 


1104 


cgc 


ate 


aag 


cag 


1152 


tac 


gcc 


ccc 


ccc 


1200 


ggc 


ctg 


ctg 


ctg 


1248 



1277 



<400> 52 
Ala Ser Ala Ala 
1 

Val Trp Lys Asp 

20 

Ala Tyr Asp Thr 
35 

Pro Thr Asp Pro 
50 

Asn Phe Asn Met 
65 

lie lie Ser Leu 



Asp Arg Leu Trp 
5 

Ala Thr Thr Thr 

Glu Val His Asn 
40 

Asn Pro Gin Glu 
55 

Gly Lys Asn Asn 
70 

Trp Asp Gin Ser 
85 



Val Thr Val Tyr 
10 

Leu Phe Cys Ala 

25 

Val Trp Ala Thr 

Val Val Leu Gly 
60 

Met Val Glu Gin 
75 

Leu Lys Pro Cys 
90 



Tyr Gly Val Pro 
15 

Ser Asp Ala Lys 

30 

His Ala Cys Val 
45 

Asn Val Thr Glu 

Met His Glu Asp 
80 

Val Lys Leu Thr 
95 
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Pro Leu Cys Val 
100 

Thr Asn Asn Thr 

115 

Gin lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

•ft±a lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
2 90 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn lie Thr 

370 

lie lie Asn Met 
385 

lie Gly Gly Gin 

Thr Arg Asp Gly 
420 



Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro lie 

Cys Asn Asn Lys 

200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu lie Asn 

His lie Gly Pro 
280 

Asp lie Arg Gin 

295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr He 

375 

Trp Gin Glu Val 
390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 



Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

He Asp Asn Asp 
155 

He He Thr Gin 
170 

His Phe Cys Ala 
185 

Thr— PhB Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 

250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 

300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 

380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser 
425 



Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 

125 

Glu Tyr Ala Leu 

Gin Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 

205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 



<210> 53 
<211> 432 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . , , (432) 



<400> 53 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 



etc egg caa get gat ctg cac cac ggc cgt gee ctg gaa cgc eag ctg 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 



96 
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20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 



gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 

50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 

70 75 80 

get gga caa gtg ggc aag ett gtg gaa ctg gtt caa cat cac caa ctg 288 

Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 

85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 

Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 

100 105 110 

cct gcg cat cgt gtt cac cgt get gag eat cgt gaa ccg cgt gcg cca 384 

Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 

115 120 125 

ggg eta cag ecc cct gag ett cca gac ccg cct gee cgt gee ccg egg 432 

Gly Leu Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 

130 135 140 



<210> 54 
<211> 144 
<212> PRT 

<213> Artificial Sequence 



<400> 54 



Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






Glu 


Gin 


Gin 
35 


Glu 


Pro 


Glu 


Pro 


Asp 
40 


Leu 


Gly 


Gin 


His 


Asp 
45 


Leu 


Asp 


Gly 


Val 


Gly 
50 


Ala 


Arg 


Asp 


Gin 


Gin 
55 


Leu 


His 


Arg 


Asp 


His 
60 


Leu 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 










80 


Ala 


Gly 


Gin 


Val 


Gly 
85 


Lys 


Leu 


Val 


Glu 


Leu 
90 


Val 


Gin 


His 


His 


Gin 
95 


Leu 


Ala 


Val 


Val 


His 

100 


Gin 


Asp 


Phe 


His 


His 

105 


Asp 


Arg 


Gly 


Arg 


Pro 

110 


Asp 


Arg 


Pro 


Ala 


His 

115 


Arg 


Val 


His 


Arg 


Ala 
120 


Glu 


His 


Arg 


Glu 


Pro 
125 


Arg 


Ala 


Pro 


Gly 


Met 
130 


Gin 


Pro 


Pro 


Glu 


Leu 
135 


Pro 


Asp 


Pro 


Pro 


Ala 
140 


Arg 


Val 


Thr 


Asp 



<210> 

<211> 
<212> 



55 

434 

DNA 



wo 00/29561 PCT/DKOO/00144 

28 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (434) 

<400> 55 

tct aga gcg eta act cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg cae eac gge cgt gee ctg gaa cge cag ctg 96 
-beu Arg Gin Ala Asp Leu His His Gty— Arg Ala Leu Glu Arg Gin Leu 
20 25 30 

gag caa caa gaa eet gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 

50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 24 0 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

get gga caa gtg gge aag ctt gtg gaa ctg gtt caa eat cac caa ctg 288 
Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 
85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 
Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 
100 105 110 

cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 
Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 
115 120 125 

ggg atg cag ccc cct gag ctt cca gac ccg cct gee cgt gtg aeg gat 432 
Gly Met Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
130 135 . 140 

cc 434 

<210> 56 
<211> 144 

<212> PRT 

<213> Artificial Sequence 
<400> 56 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 

20 25 30 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 

35 40 45 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 
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Arq 


Glv 

x_j J. jr 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


65 










70 










75 










80 


Ala 


Glv 


Gin 


Val 


Gly 
85 


Lys 


Leu 


Val 


Glu 


Leu 

90 


Val 


Gin 


His 


His 


Gin 

95 


Leu 


Ala 


Val 


Val 


His 
100 


Gin 


Asp 


Phe 


His 


His 
105 


Asp 


Arg 


Gly 


Arg 


Pro 
110 


Asp 


Arg 


Pro 


Ala 


His 
115 


Arg 


Val 


His 


Arg 


Ala 
120 


Glu 


His 


Arg 


Glu 


Pro 
125 


Arg 


Ala 


Pro 


Gly 


Met 
130 


Gin 


Pro 


Pro 


Glu 


Leu 
135 


Pro 


Asp 


Pro 


Pro 


Ala 
140 


Arg 


Val 


Thr 


Asp 



<210> 57 
<211> 281 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (281) 

<400> 57 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg eac cac gge egt gee ctg gaa cgc cag ctg 96 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 
20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 



cga gga gag cca gaa cca gca gga gaa gaa cga get gga ect get cca 24 0 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 

65 70 75 80 

get gga caa gtg gge aag ctt gtg tga ctg att gag gat ee 281 

Ala Gly Gin Val Gly Lys Leu Val * Leu lie Glu Asp 
85 90 





<210> 


58 












<211> 


92 












<212> 


PRT 












<213> 


Artificial i 


Sequence 




<400> 


58 










Ser 


Arg Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


1 






5 








Leu 


Arg Gin 


Ala 


Asp 


Leu 


His 


His 






20 










Glu 


Gin Gin 


Glu 


Pro 


Glu 


Pro 


Asp 




35 










40 


Val 


Gly Ala 


Arg 


Asp 


Gin 


Gin 


Leu 



Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 




10 










15 




Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 


25 










30 






Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 










45 








His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 
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50 55 60 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

Ala Gly Gin Val Gly Lys Leu Val Leu lie Glu Asp 
85 90 

<210> 59 
<211> 272 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (272) 

<400> 59 

ate gat tgc gcg ace tgc tgc tga teg tgg eee gca teg tgg age tge 48 

lie Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
15 10 15 

tgg gee ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 96 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 
20 25 30 

agt act gga gcc agg age tga aga act ctg cag tga gcc tgc tga acg 14 4 

Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin * Ala Cys * Thr 

35 40 

cea ecg eca teg eeg tgg ccg agg gca ccg acc gcg tga teg agg tgg 192 
Pro Pro Pro Ser Pro Trp Pro Arg Ala Pro Thr Ala * Ser Arg Trp 
45 50 55 

tge age gca tct ggc gcg gea tee tge aea tec cea eee gaa ttc gee 240 
Cys Ser Ala Ser Gly Ala Ala Ser Cys Thr Ser Pro Pro Glu Phe Ala 
60 65 70 

agg get teg age gcg eee tge tgt aag gat cc 272 
Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
75 80 



<210> 60 

<211> 84 
<212> PRT 

<213> Artificial Sequence 



<400> 60 



He 


Asp 


Cys 


Ala 


Thr 


Cys 


Cys 


Ser 


Trp 


Pro 


Ala 


Ser 


Trp 


Ser 


Cys 


Trp 


1 








5 










10 










15 


Ala 


Gly Ala 


Ala 


Gly 


Arg 


Ser 


Ser 


Thr 


Gly 


Gly 


Thr 


Cys 


Ser 


Ser 


Thr 








20 










25 










30 






Gly 


Ala 


Arg 


Ser 


Arg 


Thr 


Leu 


Gin 


Ala 


Cys 


Thr 


Pro 


Pro 


Pro 


Ser 


Pro 






35 










40 










45 








Trp 


Pro 


Arg 


Ala 


Pro 


Thr 


Ala 


Ser 


Arg 


Trp 


Cys 


Ser 


Ala 


Ser 


Gly 


Ala 




50 










55 










60 








Ala 


Ser 


Cys 


Thr 


Ser 


Pro 


Pro 


Glu 


Phe 


Ala 


Arg 


Ala 


Ser 


Ser 


Ala 


Pro 


65 










70 










75 










80 


Cys 


Cys 


Lys 


Asp 
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<210> 61 
<211> 798 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (798) 

<400> 61 

etc gag cag egg caa gga gat ttt ccg cec egg egg egg cga eat gcg 4 8 

i:;eti Glu Gin Arg Gin Gly Asp Phe PtirVrb Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gee cct ggg cat cgc cec cac caa ggc caa gcg ccg cgt ggt gca gcg 144 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg ege cgt ggg eat egg cgc tat gtt eet egg ctt ect ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg ege cgc cag ect gac cct gae cgt gea 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg eat cgt gca gca gea gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

eat caa gea get cca ggc ccg cgt get ggc tet aga gcg eta ect eea 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 

115 120 ' • 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gcc ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa eat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



caa eta cac cga gat eat eta cag cct gat cga gga gag eea gaa cca 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 



576 
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gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 

ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa gat 672 

Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin Asp 
210 215 220 

ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt cac 720 

Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val His 

225 230 235 240 

cgt get gag cat cgt gaa eeg cgt gcg^'cea ggg eta eag cec cct gag 7 68 

Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro Glu 
245 250 255 

ctt cca gae eeg eet gee cgt gee eeg egg 7 98 

Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
260 265 



<210> 62 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 62 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 


1 








5 










10 










15 




Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 








20 










25 










30 






Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala 


Ala 






35 










40 










45 








Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 




50 










55 










60 










Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


Arg 


Gin 


Pro 


Asp 


Pro 


Asp 


Arg 


Ala 


65 










70 










75 










80 


Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Pro 


Ala 










85 










90 










95 




Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 


Pro 


Ala 


Pro 


Ala 


Asp 


Arg 


Val 


Gly 








100 










105 










110 






His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 


Ala 


Gly 


Ser 


Arg 


Ala 


Leu 


Pro 


Pro 






115 










120 










125 








Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


Leu 


Arg 


Gin 


Ala 


Asp 


Leu 




130 










135 










140 










His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


145 










150 










155 










160 


Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 


Val 


Gly 


Ala 


Arg 


Asp 


Gin 










165 










170 










175 




Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 








180 










185 










190 






Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala 


Pro 


Ala 


Gly 


Gin 


Val 


Gly 


Lys 






195 










200 










205 








Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 


Gin 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 




210 










215 










220 










Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


Asp 


Arg 


Pro 


Ala 


His 


Arg 


Val 


His 


225 










230 










235 










240 


Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg 


Ala 


Pro 


Gly 


Leu 


Gin 


Pro 


Pro 


Glu 










245 










250 










255 
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Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
260 265 

<210> 63 
<211> 800 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> {!)... (800) 

<400> 63 

etc gag cag egg caa gga gat ttt ccg cec egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gee cct ggg eat cgc cec cae caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg eat egg cgc tat gtt eet egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gae cct gae cgt gea 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg eca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gae cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

eat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct eca 38 4 

His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga eca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gcc ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

eca gat ttg gga caa eat gae ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



caa eta cac ega gat eat eta cag eet gat cga gga gag eca gaa cca 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 



576 
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180 185 190 

gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 

195 200 205 

ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa gat 672 

Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin Asp 

210 215 220 

ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt cac 720 

Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val His 

230 235 240 

cgt get gag cat cgt gaa ccg cgt gcg cca ggg atg cag cec cct gag 7 68 

Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 , 250 255 

ctt cca gac ccg cct gee cgt gtg acg gat cc 800 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 



<210> 64 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 64 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


1 








5 








Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 








20 










Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 






35 










40 


Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 




50 










55 




Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


65 










70 






Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 










85 








Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 








100 










His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 






115 










120 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 




130 










135 




His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


145 










150 






Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 










165 








Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 








180 










Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 






195 










200 


Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 




210 










215 




Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


225 










230 







Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His 


Ala 




10 










15 




Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp 


Arg 


25 










30 






Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala 


Ala 










45 








Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 








60 










Arg 


Gin 


Pro 


Asp 


Pro 


Asp 


Arg 


Ala 






75 










80 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Pro 


Ala 




90 










95 




Pro 


Ala 


Pro 


Ala 


Asp 


Arg 


Val 


Gly 


105 










110 






Ala 


Gly 


Ser 


Arg 


Ala 


Leu 


Pro 


Pro 










125 








Gly 


Leu 


Leu 


Arg 


Gin 


Ala 


Asp 


Leu 








140 










Gin 


Leu 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 






155 










160 


Asp 


Gly 


Val 


Gly 


Ala 


Arg 


Asp 


Gin 




170 










175 




Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


185 










190 






Ala 


Pro 


Ala 


Gly 


Gin 


Val 


Gly 


Lys 










205 








Gin 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 








220 








Asp 


Arg 


Pro 


Ala 


His 


Arg 


Val 


His 






235 










240 



wo 00/29561 



PCT/DKOO/00144 



35 



Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 250 255 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 



<210> 65 
<211> 647 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) ... (647) 



<400> 65 

etc gag cag egg caa gga gat ttt cog ccc egg egg egg ega eat geg 
Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
5 10 15 



1 



cge tge agg cag cac eat ggg ege ege cag cet gae cet gac egt gca 
Ara Cvs Arq Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

gge eeg eea get get gag egg cat egt gea gea gca gaa caa cet get 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 
85 90 95 

geg ege eat ega ggc eea gca gea cet get cca get gac egt gtg ggg 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 • 110 

eat caa gca get cca ggc ccg egt get ggc tct aga geg eta cet cca 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 



gga cca geg ett cet ggg cat gtg ggg ctg etc egg caa get gat etg 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 



cac cac ggc egt gee etg gaa ege cag etg gag caa caa gaa ect gag 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

eea gat ttg gga caa cat gae ctg gat gga gtg gga geg ega gat cag 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



48 



96 



ega caa ctg geg cag ega get gta caa gta caa ggt ggt gaa gat ega 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gee cet ggg eat ege ccc cac caa gge caa geg eeg egt ggt gca geg 144 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

ega gaa geg ege egt ggg cat egg ege tat gtt cet egg ett cet ggg 192 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 



240 



288 



336 



384 



432 



480 



528 
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caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 576 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 

180 185 190 



gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 624 
Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 



ett gtg tga ctg att gag gat cc 
Leu Val * Leu lie Glu Asp 
210 



647 



<210> 66 
<211> 214 
<212> PRT 

<213> Artificial Sequence 



<400> 66 
Leu Glu Gin Arg 
1 

Arg Gin Leu Ala 
20 

Ala Pro Gly His 
35 

Arg Glu Ala Arg 
50 

Arg Cys Arg Gin 
65 

Gly Pro Pro Ala 

Ala Arg His Arg 
100 

His Gin Ala Ala 
115 

Gly Pro Ala Leu 
130 

His His Gly Arg 
145 

Pro Asp Leu Gly 

Gin Leu His Arg 
180 

Ala Gly Glu Glu 
195 

Leu Val Leu lie 

210 



Gin 


Gly 


Asp 


Phe 


5 








Gin 


Arg 


Ala 


Val 


Arg 


Pro 


His 


Gin 








40 


Arg 


Gly 


His 


Arg 






55 




His 


His 


Gly 


Arg 




70 






Ala 


Glu 


Arg 


His 


85 








Gly 


Pro 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 








120 


Pro 


Gly 


His 


Val 






135 




Ala 


Leu 


Glu 


Arg 




150 






Gin 


His 


Asp 


Leu 


165 








Asp 


His 


Leu 


Gin 


Arg 


Ala 


Gly 


Pro 








200 


Glu 


Asp 







Pro Pro Arg Arg 

10 

Gin Val Gin Gly 
25 

Gly Gin Ala Pro 

Arg Tyr Val Pro 
60 

Arg Gin Pro Asp 
75 

Arg Ala Ala Ala 

90 

Pro Ala Pro Ala 
105 

Ala Gly Ser Arg 

Gly Leu Leu Arg 

140 

Gin Leu Glu Gin 
155 

Asp Gly Val Gly 
170 

Pro Asp Arg Gly 
185 

Ala Pro Ala Gly 



Arg Arg His Ala 
15 

Gly Glu Asp Arg 
30 

Arg Gly Ala Ala 
45 

Arg Leu Pro Gly 

Pro Asp Arg Ala 
80 

Glu Gin Pro Ala 

95 

Asp Arg Val Gly 
110 

Ala Leu Pro Pro 
125 

Gin Ala Asp Leu 

Gin Glu Pro Glu 
160 

Ala Arg Asp Gin 
175 

Glu Pro Glu Pro 

190 

Gin Val Gly Lys 
205 



<210> 67 
<211> 1918 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (1918) 



<400> 67 

get age gcg gee gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ecc 



48 
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Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 



gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gcc aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 



96 



gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



144 



ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
-PlTO Thr Asp Pro Asn Pro Gin Glu Valr-Val Leu Gly Asn Val Thr Glu 
50 55 60 



192 



aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 



240 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 



288 



ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 



336 



acc aac aac acc ego tgg ggc acc cag gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 



384 



aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



432 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 



480 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gee ggc ttc 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 



576 



gcc ate ctg aag tgc aac aac aag ace ttc aac ggc ace ggc ccc tgc 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 



acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 

210 215 220 



672 



age acc cag ctg ctg ctg aac ggc age ctg gcc gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 



720 



wo 00/29561 PCT/DKOO/001 44 

38 

ate aga tct gag aac ttc acc aac aac gcc aag acc ate ate gtg cag 768 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aae tge acc egc cec aac aac aac acc 816 
Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

egc aag age ate eac ate ggc cct gge egc gcc ttc tac acc acc ggc 864 
Arg Lys Ser lie His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

-Ofsro ate ate ggc gac ate egc cag gcir-can tgc aac ate tct aga acc 912 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac ace ctg aag egc gtg gcc gag aag ctg egc gag aag 960 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cae age ttc aac tgc ggc gge gag ttc ttc tac tgc 1056 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc egc ate aag cag 1152 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac gcc ccc ccc 1200 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate ace ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc egc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 1296 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 

egc ccc ggc ggc gge gac atg egc gac aac tgg egc age gag ctg tac 134 4 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gcc ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 



gcc aag egc egc gtg gtg cag egc gag aag egc gcc gtg ggc ate ggc 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
465 470 475 480 



1440 
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get atg ttc etc ggc ttc ctg ggc get gca gge age acc atg ggc gee 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gee age ctg acc ctg acc gtg cag gcc cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gee ate gag gcc cag cag eac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 
515 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gcc cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag egc tac etc cag gae eag cgc ttc ctg gge atg tgg 1680 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

gge tge tec ggc aag ctg ate tge acc aeg gcc gtg ecc tgg aac gcc 1728 
Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aac aag aac ctg age cag att tgg gae aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aae tac aee gag ate ate tac age 1824 
Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 

595 600 605 

ctg ate gag gag age eag aae eag cag gag aag aae gag ctg gae etg 1872 
Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 

etc cag ctg gae aag tgg gca age ttg tgt gae tga ttg agg ate e 1918 
Leu Gin Leu Asp Lys Trp Ala Ser Leu Cys Asp * Leu Arg lie 
625 630 635 



<210> 68 
<211> 638 
<212> PRT 

<213> Artificial Sequence 

<400> 68 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
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Pro Leu Cys Val 
100 

Thr Asn Asn Thr 
115 

Asn lie Ser Thr 

130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 

210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
290 

Asn Trp Thr Asn 

305 

Phe Asn Asn Thr 

Glu He Val Met 
340 

Asn Thr Thr Gin 

355 

Gly Asn He Thr 
370 

He He Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 

435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 

465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 
530 

Leu Ala Leu Glu 
545 



85 

Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 

135 

Asp He Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro He Pro He 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 

215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu He Asn 

His He Gly Pro 
280 

Asp He Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 

390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 

485 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 

535 

Arg Tyr Leu Gin 
550 



90 

Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 

140 

He Asp Asn Asp 
155 

He He Thr Gin 
170 

His Phe Cys Ala 

18"5"'~ 

Thr Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 

235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 

315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly He 
460 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 

490 

Ala Arg Gin Leu 
505 

Arg Ala He Glu 

He Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 



95 

Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 

205 

Arg Pro Val Val 

Glu Glu Val Val 

240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 

285 

He Ser Arg Thr 

Leu Arg Glu Lys 

320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 

415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 
445 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 

495 

Leu Ser Gly He 
510 

Ala Gin Gin His 
525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 
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Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 575 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 

595 600 605 

Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Cys Asp Leu Arg lie 
625 630 635 

<210> 69 
<211> 2071 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2071) 

<400> 69 

get age gcg gee gac cge ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gee age gac gee aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac ace gag gtg cac aac gtg tgg gee acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 24 0 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac ace cge tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cge aac aag atg aag cge gag tac gee ctg 4 32 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 



480 
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145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate aee cag gee tge eee aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag eee ate eee ate eac ttc tge gee eee gee ggc tte 576 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

acc aac gtg age acc gtg cag tge acc cae gga att cgc eee gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age ace cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

cgc aag age ate cae ate ggc cet ggc cgc gee tte tae acc acc ggc 864 

Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gae ate ate ggc gac ate cgc cag gee cae tgc aac ate tct aga acc 912 

Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 
290 295 300 

aac tgg aee aac acc ctg aag cgc gtg gcc gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 

Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

gag ate gtg atg cae age ttc aac tgc ggc ggc gag ttc ttc tae tgc 1056 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

aac acc ace cag ctg ttc aac age acc tgg aac gag acc aac age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc ace ate acc ctg ccc tgc cgc ate aag cag 1152 

Gly Asn lie Thr Ser Gly Thr lie Thr Leu Pro Cys Arg lie Lys Gin 
370 375 380 



ate ate aac atg tgg cag gag gtg ggc aag gcc atg tae gcc ccc ccc 



1200 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aae ate aee gge etg etg etg 1248 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn lie Thr Gly Leu Leu Leu 
405 410 415 

acc cge gac gge gge age gac aac teg age age gge aag gag att ttc 1296 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie Phe 
420 425 430 

cgc ccc ggc gge gge gae atg cge gac aac tgg ege age gag ctg tac 1344 

7^g Pro Gly Gly Gly Asp Met Arg AslTT^sh Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag eee etg gge ate gee eee aee aag 1392 

Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly lie Ala Pro Thr Lys 

450 455 460 

gee aag ege ege gtg gtg cag cge gag aag ege gee gtg gge ate gge 144 0 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

get atg tte etc gge ttc ctg ggc get gea gge age aee atg gge gee 1488 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gcc age ctg acc ctg acc gtg cag gcc cgc cag ctg ctg age ggc ate 1536 

Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
500 505 510 

gtg cag cag cag aae aac ctg ctg cgc gee ate gag gee cag cag eac 1584 

Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His 
515 520 525 

etg etc eag etg aee gtg tgg gge ate aag eag etc eag gcc ege gtg 1632 

Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 

530 535 540 

ctg get eta gag ege tac etc cag gae eag ege tte ctg gge atg tgg 1680 

Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 • 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gcc gtg ccc tgg aac gcc 1728 

Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aae aag aae etg age eag att tgg gac aac atg aee tgg 1776 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 
580 585 590 

atg gag tgg gag cgc gag ate age aae tac acc gag ate ate tac age 1824 

Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Glu lie lie Tyr Ser 
595 600 605 



ctg ate gag gag age eag aae cag cag gag aag aae gag etg gae ctg 
Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 



1872 
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etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate aee 1920 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
625 630 635 640 

aae tgg etg tgg tac ate aag att ttc ate atg ate gtg ggc gge ctg 1968 
Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met lie Val Gly Gly Leu 
645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aae cgc gtg 2016 
lie Gly Leu Arg lie Val Phe Thr Val Leu Ser lie Val Asn Arg Val 
660 665 670 

cgc cag gga tge age cec etg age ttc cag acc cgc ctg cec gtg tga 2064 
Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val * 
675 680 685 

egg ate e 2071 
Arg lie 



<210> 70 
<211> 689 
<212> PRT 

<213> Artificial Sequence 



<400> 70 
Ala Ser Ala Ala 
1 

Val Trp Lys Asp 
20 

Ala Tyr Asp Thr 
35 

Pro Thr Asp Pro 
50 

Asn Phe Asn Met 
65 

lie lie Ser Leu 

Pro Leu Cys Val 
100 

Thr Asn Asn Thr 
115 

Asn lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 

180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 



Asp Arg Leu Trp 

5 

Ala Thr Thr Thr 

Glu Val His Asn 
40 

Asn Pro Gin Glu 
55 

Gly Lys Asn Asn 
70 

Trp Asp Gin Ser 

85 

Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 

135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro lie 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 



Val Thr Val Tyr 

10 

Leu Phe Cys Ala 
25 

Val Trp Ala Thr 

Val Val Leu Gly 
60 

Met Val Glu Gin 
75 

Leu Lys Pro Cys 

90 

Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

lie Asp Asn Asp 
155 

He He Thr Gin 

170 

His Phe Cys Ala 

185 

Thr Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 



Tyr Gly Val Pro 
15 

Ser Asp Ala Lys 
30 

His Ala Cys Val 
45 

Asn Val Thr Glu 

Met His Glu Asp 
80 

Val Lys Leu Thr 

95 

Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 

175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 
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Leu Asn Glu Ser 
260 

Arg Lys Ser lie 

275 

Asp lie lie Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Thr Thr Gin 

355 

Gly Asn lie Thr 
370 

lie lie Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 

435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 

530 

Leu Ala Leu Glu 
545 

Gly Cys Ser Gly 

Ser Trp Ser Asn 
580 

Met Glu Trp Glu 
595 

Leu He Glu Glu 
610 

Leu Gin Leu Asp 
625 

Asn Trp Leu Trp 

He Gly Leu Arg 
660 

Arg Gin Gly Cys 
675 

He 



Val Glu He Asn 

His He Gly Pro 
280 

Asp He Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 

325 

His Ser Phe Asn 

Leu Phe Asn Ser 

360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 
390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 
485 

Leu Thr Val Gin 

Asn Asn Leu Leu 

520 

Thr Val Trp Gly 

535 

Arg Tyr Leu Gin 
550 

Lys Leu He Cys 
565 

Lys Asn Leu Ser 

Arg Glu He Ser 
600 

Ser Gin Asn Gin 

615 

Lys Trp Ala Ser 
630 

Tyr He Lys He 
645 

He Val Phe Thr 

Ser Pro Leu Ser 
680 



Cys 


1 nr 


Arg 


Pro 


O c: 










Arg 


f\± a 


ir ne 


/\-La 


His 


Cys 


Asn 








300 


Val 


Ala 


Glu 


Lys 






315 




Asn 


Gin 


Ser 


o o -r- 




330 






Cys 


Gly 


Gly 


o J.U 


345 








Thr Trp Asn 




Thr 


Leu 


Pro 


Cys 








380 


Gly 


Lys 


Ala 


Met 






395 




Ser 


Asn 


He 


1 nr 




410 






Ser 


Ser 


Ser 


vjxy 


425 








Asp 


Asn 


J. j.p 


Arg 


Pro 


Leu 


Gly 


1 ±e 








460 


Glu 


Lys 


Arg 


Ala 






475 




Ala 


Ala 


Gly 


Ser 




490 






Ala 


Arg 


Gin 


Leu 


505 








Arg 


Ala 


He 


vjXU 


He 


Lys 


Gin 


Leu 








540 


Asp 


Gin 


Arg 


Phe 






555 




Thr 


Thr 


Ala 


vax 




570 






Gin 


He 


Trp 


Asp 


585 








Asn 


Tyr 


Thr 


V3XU 


Gin 


Glu 


Lys 


Asn 








620 


Leu 


Trp 


Asn 


Trp 






635 




Phe 


He 


Met 


He 




650 






Val 


Leu 


Ser 


He 


665 








Phe 


Gin 


Thr 


Arg 



Asn Asn Asn Thr 

270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 

320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 

365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 
445 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly He 
510 

Ala Gin Gin His 

525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 

Pro Trp Asn Ala 
575 

Asn Met Thr Trp 
590 

He He Tyr Ser 
605 

Glu Leu Asp Leu 

Phe Asn He Thr 
640 

Val Gly Gly Leu 
655 

Val Asn Arg Val 
670 

Leu Pro Val Arg 
685 



<210> 71 
<211> 2469 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2469) 

<400> 71 

got age gcg gee gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 
Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

VfVg tgg aag gac gcc acc acc acc ct"g"ttc tgc gcc age gac gcc aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 14 4 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac cee cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg ace 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ccc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate cee ate cac ttc tgc gee ccc gee ggc ttc 576 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 



gcc ate ctg aag tgc aac aac aag acc ttc aac ggc ace ggc cee tgc 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 
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acc aac gtg 
Thr Asn Val 
210 

age acc cag 
Ser Thr Gin 
225 

ate aga tct 
lie Arg Ser 



ctg aac gag 
Leu Asn Glu 



cgc aag age 
Arg Lys Ser 

275 

gac ate ate 
Asp lie lie 
290 

aac tgg ace 
Asn Trp Thr 
305 

ttc aac aac 
Phe Asn Asn 



gag ate gtg 
Glu lie Val 



aac acc acc 
Asn Thr Thr 
355 

ggc aac ate 
Gly Asn lie 
370 

ate ate aac 
lie lie Asn 
385 

ate ggc gge 
lie Gly Gly 



ace cgc gac 
Thr Arg Asp 



cgc ccc ggc 
Arg Pro Gly 



age acc gtg 
Ser Thr Val 



ctg ctg ctg 
Leu Leu Leu 
230 

gag aac ttc 
Glu Asn Phe 
245 

age gtg gag 
Ser Val Glu 
260 

ate eae ate 
lie His lie 



gge gac ate 
Gly Asp lie 



aac acc ctg 
Asn Thr Leu 
310 

ace ace ate 
Thr Thr lie 
325 

atg eae age 
Met His Ser 
340 

cag ctg ttc 
Gin Leu Phe 



act agt ggc 
Thr Ser Gly 



atg tgg cag 
Met Trp Gin 
390 

eag ate aag 
Gin lie Lys 

405 

ggc ggc age 
Gly Gly Ser 
420 

ggc ggc gac 
Gly Gly Asp 



cag tge ace 
Gin Cys Thr 
215 

aac ggc age 
Asn Gly Ser 



acc aac aac 
Thr Asn Asn 



ate aac tge 
lie Asn Cys 
265 

gge ect gge 
Gly Pro Gly 

280 

cgc cag gee 
Arg Gin Ala 
295 

aag cgc gtg 
Lys Arg Val 



gtg ttc aac 
Val Phe Asn 



ttc aac tge 
Phe Asn Cys 
345 

aac age acc 
Asn Ser Thr 
360 

acc ate acc 
Thr lie Thr 
375 

gag gtg ggc 
Glu Val Gly 



tge ctg age 
Cys Leu Ser 



gac aac teg 
Asp Asn Ser 
425 

atg cgc gac 
Met Arg Asp 



cac gga att 
His Gly lie 
220 

ctg gee gag 
Leu Ala Glu 
235 

gee aag acc 
Ala Lys Thr 
250 

acc cgc ccc 
Thr Arg Pro 



cgc gee ttc 
Arg Ala Phe 



eae tge aac 
His Cys Asn 
300 

gee gag aag 
Ala Glu Lys 
315 

cag age tec 
Gin Ser Ser 
330 

ggc ggc gag 
Gly Gly Glu 



tgg aac gag 
Trp Asn Glu 



ctg ccc tge 
Leu Pro Cys 
380 

aag gee atg 
Lys Ala Met 
395 

aac ate ace 
Asn lie Thr 
410 

age age gge 
Ser Ser Gly 



aac tgg cgc 
Asn Trp Arg 



cgc ccc gtg 
Arg Pro Val 



gag gag gtg 
Glu Glu Val 



ate ate gtg 
lie He Val 
255 

aac aac aac 
Asn Asn Asn 
270 

tae ace ace 
Tyr Thr Thr 

285 

ate tct aga 
He Ser Arg 



ctg cgc gag 
Leu Arg Glu 



ggc ggc gac 
Gly Gly Asp 
335 

ttc ttc tae 
Phe Phe Tyr 
350 

acc aac age 
Thr Asn Ser 
365 

cgc ate aag 
Arg He Lys 



tae gee ccc 
Tyr Ala Pro 



ggc ctg ctg 
Gly Leu Leu 
415 

aag gag att 
Lys Glu He 
430 

age gag ctg 
Ser Glu Leu 



gtg 672 
Val 



gtg 720 

Val 

240 

cag 768 
Gin 



acc 816 
Thr 



gge 864 
Gly 



ace 912 
Thr 



aag 960 

Lys 

320 

ccc 1008 
Pro 



tge 1056 
Cys 



gag 1104 
Glu 



eag 1152 
Gin 



ccc 1200 

Pro 

400 

ctg 1248 
Leu 



ttc 1296 
Phe 



tac 1344 
Tyr 
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435 

aag tac aag 
Lys Tyr Lys 
450 

gcc aag cgc 
Ala Lys Arg 
465 

get atg ttc 
Ala Met Phe 



gcc age ctg 
Ala Ser Leu 



gtg cag cag 
Val Gin Gin 
515 

ctg etc cag 
Leu Leu Gin 
530 

ctg get eta 
Leu Ala Leu 
545 

ggc tgc tec 
Gly Cys Ser 



age tgg age 
Ser Trp Ser 



atg gag tgg 
Met Glu Trp 
595 

ctg ate gag 
Leu lie Glu 
610 

etc cag ctg 
Leu Gin Leu 
625 

aac tgg ctg 
Asn Trp Leu 



ate ggc ctg 
lie Gly Leu 



cgc cag ggc 



gtg gtg aag 
Val Val Lys 



cgc gtg gtg 
Arg Val Val 
470 

etc gge tte 
Leu Gly Phe 
485 

ace ctg acc 
Thr Leu Thr 

500 

cag aac aac 
Gin Asn Asn 



ctg ace gtg 
Leu Thr Val 



gag cgc tac 
Glu Arg Tyr 
550 

ggc aag ctg 
Gly Lys Leu 
565 

aac aag aac 
Asn Lys Asn 
580 

gag cgc gag 
Glu Arg Glu 



gag age cag 
Glu Ser Gin 



gac aag tgg 
Asp Lys Trp 
630 

tgg tac ate 
Trp Tyr lie 
645 

cgc ate gtg 
Arg He Val 
660 

tac age ccc 



440 

ate gag ccc 
He Glu Pro 
455 

cag cgc gag 
Gin Arg Glu 



ctg gge get 
Leu Gly Ala 



gtg cag gee 
Val Gin Ala 

505 

ctg ctg cgc 
Leu Leu Arg 
520 

tgg gge ate 
Trp Gly He 
535 

etc cag gac 
Leu Gin Asp 



ate tgc acc 
lie Cys Thr 



ctg age cag 
Leu Ser Gin 
585 

ate age aac 
He Ser Asn 
600 

aac cag cag 
Asn Gin Gin 
615 

gca age ttg 
Ala Ser Leu 



aag att tte 
Lys He Phe 



tte acc gtg 
Phe Thr Val 
665 

ctg age tte 



ctg ggc ate 
Leu Gly He 
460 

aag cgc gee 
Lys Arg Ala 
475 

gca gge age 
Ala Gly Ser 
490 

cgc cag ctg 
Arg Gin Leu 



gcc ate gag 
Ala He Glu 



aag cag etc 
Lys Gin Leu 
540 

cag cgc ttc 
Gin Arg Phe 
555 

acg gcc gtg 
Thr Ala Val 
570 

att tgg gac 
He Trp Asp 



tac ace gag 
Tyr Thr Glu 



gag aag aac 
Glu Lys Asn 
620 

tgg aac tgg 
Trp Asn Trp 
635 

ate atg ate 
He Met He 
650 

ctg age ate 
Leu Ser He 



cag ace cgc 



445 

gcc ccc acc 
Ala Pro Thr 



gtg ggc ate 
Val Gly He 



ace atg gge 
Thr Met Gly 
495 

ctg age gge 
Leu Ser Gly 
510 

gcc cag cag 
Ala Gin Gin 
525 

cag gee cgc 
Gin Ala Arg 



ctg ggc atg 
Leu Gly Met 



ccc tgg aac 
Pro Trp Asn 
575 

aac atg acc 
Asn Met Thr 
590 

ate ate tac 
He He Tyr 
605 

gag ctg gac 
Glu Leu Asp 



ttc aac ate 
Phe Asn He 



gtg ggc ggc 
Val Gly Gly 
655 

gtg aac cgc 
Val Asn Arg 
670 

ctg ccc gtg 



aag 1392 
Lys 



ggc 14 4 0 

Gly 

480 

gee 1488 
Ala 



ate 1536 
He 



cac 1584 
His 



gtg 1632 
Val 



tgg 1680 

Trp 

560 

gcc 1728 
Ala 



tgg 1776 
Trp 



age 1824 
Ser 



ctg 1872 
Leu 



acc 1920 

Thr 

640 

ctg 1968 
Leu 



gtg 2016 
Val 



ccc 2064 
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Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val Pro 
675 680 685 

cgc ggc ccc gac cgc ccc gag ggc ate gag gag gag ggc ggc gag cgc 2112 
Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly Glu Arg 
690 695 700 

gac cgc gac cgc age acc cgc ctg gtg acc ggc ttc ctg ccc ctg ate 2160 
Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu lie 
705 710 715 720 

tgg gac gac ctg cgc age ctg ttc ctg ttc age tac cat ega ttg cgc 2208 
*3?p Asp Asp Leu Arg Ser Leu Phe Leu— Phe Ser Tyr His Arg Leu Arg 
725 730 735 

gac ctg ctg ctg ate gtg gcc cgc ate gtg gag ctg ctg ggc egg cgc 2256 
Asp Leu Leu Leu lie Val Ala Arg lie Val Glu Leu Leu Gly Arg Arg 
740 745 750 

ggc tgg gag ate ctg aag tac tgg tgg aac ctg etc cag tac tgg age 2304 
Gly Trp Glu lie Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 
755 760 765 

cag gag ctg aag aac tet gca gtg age ctg ctg aac gcc acc gee ate 2352 
Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala lie 
770 775 780 

gcc gtg gcc gag ggc acc gac cgc gtg ate gag gtg gtg cag cgc ate 2400 
Ala Val Ala Glu Gly Thr Asp Arg Val lie Glu Val Val Gin Arg lie 
785 790 795 800 

tgg cgc ggc ate ctg cac ate ccc ace ega att cgc cag ggc ttc gag 2448 
Trp Arg Gly lie Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 
805 810 815 

cgc gcc ctg ctg taa gga tee 2469 
Arg Ala Leu Leu * Gly Ser 
820 



<210> 72 
<211> 822 
<212> PRT 

<213> Artificial Sequence 



<400> 72 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 




Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 






Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 
85 


Asp 


Gin 


Ser 


Leu 


Lys 
90 


Pro 


Cys 


Val 


Lys 


Leu 
95 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 
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Thr Asn Asn Thr 
115 

Asn lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 

180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser lie 
275 

Asp lie lie Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu lie Val Met 
340 

Asn Thr Thr Gin 

355 

Gly Asn lie Thr 
370 

lie lie Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 

435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 

530 

Leu Ala Leu Glu 
545 

Gly Cys Ser Gly 



Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp He Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro He Pro He 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu He Asn 

His He Gly Pro 
280 

Asp He Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 

325 

His Ser Phe Asn 

Leu Phe Asn Ser 

360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 
390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 

440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 
485 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 

535 

Arg Tyr Leu Gin 
550 

Lys Leu He Cys 
565 



105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

He Asp Asn Asp 
155 

He He Thr Gin 
170 

His Phe Cys Ala 

185 

Thr Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 

235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 

265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn He Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly He 
460 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 
490 

Ala Arg Gin Leu 
505 

Arg Ala He Glu 

He Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 

Thr Thr Ala Val 
570 



110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 

190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 

270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 

365 

Arg He Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 

445 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly He 
510 

Ala Gin Gin His 

525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 

Pro Trp Asn Ala 
575 
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Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 

595 600 605 

Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 
625 630 635 640 

Asn Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu 

645 650 655 

He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 
660 665 670 

Gin Gly Tyr Ser Pro Leu Ser Phe-Cl^i Thr Arg Leu Pro Val Pro 
675 680 685 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly Glu Arg 

690 695 700 

Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu He 
705 710 715 720 

Trp Asp Asp Leu Arg Ser Leu Phe Leu Phe Ser Tyr His Arg Leu Arg 

725 730 735 

Asp Leu Leu Leu He Val Ala Arg He Val Glu Leu Leu Gly Arg Arg 

740 745 750 

Gly Trp Glu He Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 

755 760 765 

Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He 

770 775 780 

Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arg He 
785 790 795 800 

Trp Arg Gly He Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 

805 810 815 

Arg Ala Leu Leu Gly Ser 
820 

<210> 73 

<211> 1431 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1431) 

<400> 73 

get age gcg gcc gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gcc aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 
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aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 



240 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag etg aee 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 



288 



ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age ace gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 



336 



aac aac acc cgc tgg ggc acc ca^-gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 



384 



aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



432 



ttc tac age etg gac ate gtg ccc ate gac aac gac aac acc age tae 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc etg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 



480 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 
Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 



576 



gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 



ace aac gtg age ace gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 



672 



age ace cag ctg ctg ctg aac ggc age etg gee gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Le.u Ala Glu Glu Glu Val Val 
225 230 235 240 



720 



ate aga tct gag aac ttc acc aac aac gcc aag acc ate ate gtg cag 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 



768 



ctg aac gag age gtg gag ate aac tgc aee cgc ccc aac aac aac acc 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 



816 



cgc aag age ate cac ate ggc cct ggc cgc gee ttc tae acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 



864 



gac ate 
Asp He 
290 



ate ggc gac ate cgc cag gee 
He Gly Asp He Arg Gin Ala 
295 



cac tgc aac ate tct aga acc 
His Cys Asn He Ser Arg Thr 
300 



912 
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aac tgg acc 
Asn Trp Thr 
305 

ttc aac aac 
Phe Asn Asn 



gag ate gtg 
Glu lie Val 



aac acc acc 
Asn Thr Thr 
355 

ggc aac ate 
Gly Asn lie 
370 

ate ate aac 
lie lie Asn 

385 

ate ggc ggc 
lie Gly Gly 



acc cgc gae 
Thr Arg Asp 



cgc cec ggc 
Arg Pro Gly 
435 

aag tac aag 
Lys Tyr Lys 
450 

gee aag cgc 
Ala Lys Arg 
465 



aac ace ctg 
Asn Thr Leu 
310 

acc acc ate 
Thr Thr lie 
325 

atg cac age 
Met His Ser 
340 

cag ctg ttc 
Gin Leu Phe 



act agt ggc 
Thr Ser Gly 



atg tgg cag 
Met Trp Gin 
390 

cag ate aag 
Gin lie Lys 
405 

ggc ggc age 
Gly Gly Ser 
420 

ggc ggc gae 
Gly Gly Asp 



gtg gtg aag 
Val Val Lys 



cgc gtg gtg 
Arg Val Val 
470 



aag cgc gtg 
Lys Arg Val 



gtg ttc aac 
Val Phe Asn 



ttc aac tgc 
Phe Asn Cys 
345 

aac age ace 
Asn Ser Thr 
360 

ace ate acc 
Thr lie Thr 
375 

gag gtg ggc 
Glu Val Gly 



tgc ctg age 
Cys Leu Ser 



gae aac teg 
Asp Asn Ser 
425 

atg cgc gae 
Met Arg Asp 
440 

ate gag cec 
lie Glu Pro 
455 

cag cgc gag 
Gin Arg Glu 



gee gag aag 
Ala Glu Lys 
315 

cag age tec 
Gin Ser Ser 
330 

ggc ggc gag 
Gly Gly Glu 



tgg aac gag 
Trp Asn Glu 



ctg cec tgc 
Leu Pro Cys 
380 

aag gee atg 
Lys Ala Met 
395 

aac ate acc 
Asn lie Thr 
410 

age age ggc 
Ser Ser Gly 



aac tgg cgc 
Asn Trp Arg 



ctg ggc ate 
Leu Gly lie 
4 60 

a*a.g cgc gee 
Lys Arg Ala 
475 



ctg cgc gag 
Leu Arg Glu 



ggc ggc gae 
Gly Gly Asp 
335 

ttc ttc tac 
Phe Phe Tyr 
350 

ace aac age 
Thr Asn Ser 
365 

cgc ate aag 
Arg lie Lys 



tac gee cec 
Tyr Ala Pro 



ggc ctg ctg 
Gly Leu Leu 
415 

aag gag att 
Lys Glu lie 
430 

age gag ctg 
Ser Glu Leu 
445 

gee cec acc 
Ala Pro Thr 



tag 



aag 960 

Lys 

320 

cec 1008 
Pro 



tgc 1056 
Cys 



gag 1104 
Glu 



cag 1152 
Gin 



cec 1200 

Pro 
400 

ctg 1248 
Leu 



ttc 1296 
Phe 



tac 1344 
Tyr 



aag 1392 
Lys 



1431 





<210> 


74 












<211> 


476 












<212> 


PRT 












<213> 


Artificial ; 


Sequence 




<400> 


74 










Ala 


Ser Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


1 






5 








Val 


Trp Lys 


Asp 


Ala 


Thr 


Thr 


Thr 






20 










Ala 


Tyr Asp 


Thr 


Glu 


Val 


His 


Asn 




35 










40 


Pro 


Thr Asp 


Pro 


Asn 


Pro 


Gin 


Glu 



Val 


Thr 


Val 


Tyr Tyr 


Gly 


Val 


Pro 




10 








15 




Leu 


Phe 


Cys 


Ala Ser 


Asp 


Ala 


Lys 


25 








30 






Val 


Trp 


Ala 


Thr His 


Ala 


Cys 


Val 








45 








Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 
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50 






Asn 


Phe 


Asn 


Met 


65 








He 


He 


Ser 


Leu 


Pro 


Leu 


Cys 


Val 








100 


Thr 


Asn 




Th r* 

X XiJU 






115 




Asn 


He 


Ser 


Thr 




130 






Phe 


Tyr 


Ser 


Leu 


4-4-5 








Arg 


Leu 


Arg 


Ser 


Val 


Ser 


Phe 


Glu 








180 


Ala 


JL JL 


Leu 


Lys 






195 




Thr 


Asn 


Val 


Ser 




210 






Ser 


Thr 


Gin 


Leu 


225 








He 


Arg 


Ser 


Glu 


Leu 


Asn 


Glu 


Ser 








260 


Arg 


Lys 


Q 

o er 


±±e 






275 




Asp 


He 


He 


Gly 




290 






Asn 


Trp 


Thr 


Asn 


305 








Phe 


Asn 


Asn 


Thr 


Glu 


He 


Val 


Met 








340 


Asn 


Thr 


X 111. 


oJ.Il 






355 




Gly 


Asn 


He 


Thr 




370 






He 


He 


Asn 


Met 


385 








He 


Gly 


Gly 


Gin 


Thr 


Arg 


Asp 


Gly 








420 


Arg 


Pro 


Gly 


Gly 






435 




Lys 


Tyr 


Lys 


Val 




450 






Ala 


Lys 


Arg 


Arg 


465 









55 

Gly Lys Asn Asn 
70 

Trp Asp Gin Ser 
85 

Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 

135 

Asp He Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro He Pro He 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu He Asn 

His He Gly Pro 
280 

Asp He Arg Gin 
295 

Thr Leu Lys Arg 

310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr He 
375 

Trp Gin Glu Val 

390 

He Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 



Met 


Val 


Glu 


Gin 






75 




Leu 


Lys 


Pro 


Cys 




90 






Thr 


Lys 


Leu 


Lys 


105 








Gin 


Glu 


Met 


Lys 


Lys 


Met 


Lys 


Arg 








14 0 

±H \J 


He 


Asp 


Asn 


Asp 






155 




He 


He 


Thr 


Gin 




170 






His 


Phe 


Cys 


Ala 


185 








Thr 


Phe 


Asn 


Gly 


Thr 


His 


Gly 


He 








990 
z, z. w 


Ser 


Leu 


Ala 


Glu 






235 




Asn 


Ala 


Lys 


Thr 




250 






Cys 


Thr 


Arg 


Pro 


265 








Gly 


Arg 


Ala 


Phe 


Ala 


His 


Cys 


Asn 










Val 


Ala 


Glu 


Lys 






315 




Asn 


Gin 


Ser 


Ser 




330 






Cys 


Gly 


Gly 


Glu 


345 








Thr 


Trp 


Asn 


Glu 


Thr 


Leu 


Pro 


Cys 








380 


Gly 


Lys 


Ala 


Met 






395 




Ser 


Asn 


He 


Thr 




410 






Ser 


Ser 


Ser 


Gly 


425 








Asp 


Asn 


Trp 


Arg 


Pro 


Leu 


Gly 


He 








460 


Glu 


Lys 


Arg 


Ala 






475 





Met His Glu Asp 
80 

Val Lys Leu Thr 
95 

Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

He He Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

He Ser Arg Thr 

Leu Arg Glu Lys 

320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg He Lys Gin 

Tyr Ala Pro Pro 

400 

Gly Leu Leu Leu 
415 

Lys Glu He Phe 
430 

Ser Glu Leu Tyr 
445 

Ala Pro Thr Lys 



<210> 75 
<211> 1038 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1038) 

<400> 75 

gcc gtg ggc ate ggc get atg ttc etc ggc ttc etg ggc get gea ggc 48 

Ala Val Gly lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly 
15 10 15 

age acc atg ggc gcc gcc age etg ace ctg acc gtg cag gcc cgc cag 96 
Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin 
20 25 30 

etg etg age ggc ate gtg cag cag cag aac aac ctg ctg cgc gcc ate 14 4 

Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
35 40 45 

gag gee cag cag eae ctg etc cag ctg acc gtg tgg ggc ate aag cag 192 
Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin 
50 55 60 

etc cag gee cgc gtg ctg get eta gag cgc tae etc cag gae cag cgc 240 
Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg 
65 70 75 80 

ttc ctg ggc atg tgg ggc tgc tec ggc aag ctg ate tgc ace aeg gee 288 
Phe Leu Gly Met Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala 
85 90 95 

gtg ecc tgg aac gcc age tgg age aac aag aac ctg age cag att tgg 336 
Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp 
100 105 110 

gae aac atg ace tgg atg gag tgg gag cgc gag ate age aac tae ace 384 
Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr 
115 120 125 

gag ate ate tae age etg ate gag gag age cag aac cag cag gag aag 432 
Glu lie lie Tyr Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys 
130 135 140 

aac gag etg gae ctg etc cag ctg gae aag tgg gea age ttg tgg aac 480 
Asn Glu Leu Asp Leu Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn 
145 150 155 160 

tgg ttc aac ate acc aac tgg etg tgg tae ate aag att ttc ate atg 528 
Trp Phe Asn lie Thr Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met 
165 170 175 

ate gtg ggc ggc ctg ate ggc ctg cgc ate gtg ttc acc gtg ctg age 576 
lie Val Gly Gly Leu lie Gly Leu Arg lie Val Phe Thr Val Leu Ser 
180 185 190 

ate gtg aac cgc gtg cgc cag ggc tae age ecc etg age ttc cag ace 624 
lie Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr 
195 200 205 



cgc ctg ecc gtg ecc cgc ggc ecc gae cgc ecc gag ggc ate gag gag 
Arg Leu Pro Val Pro Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu 



672 
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210 



215 



220 



gag ggc ggc 
Glu Gly Gly 
225 

ttc ctg ccc 
Phe Leu Pro 



tac cat cga 
Tyr His Arg 



ctg ctg ggc 
Leu Leu Gly 

275 

etc cag tac 
Leu Gin Tyr 
290 

aac gcc ace 
Asn Ala Thr 
305 

gtg gtg cag 
Val Val Gin 



cgc cag ggc 
Arg Gin Gly 



gag cgc gac 
Glu Arg Asp 
230 

ctg ate tgg 
Leu lie Trp 
245 

ttg cgc gac 
Leu Arg Asp 
260 

egg cgc ggc 
Arg Arg Gly 



tgg age cag 
Trp Ser Gin 



gcc ate gcc 
Ala lie Ala 
310 

cgc ate tgg 
Arg lie Trp 
325 

ttc gag cgc 
Phe Glu Arg 
340 



cgc gac cgc 
Arg Asp Arg 



gac gac ctg 
Asp Asp Leu 



ctg ctg ctg 
Leu Leu Leu 
26B 

tgg gag ate 
Trp Glu lie 
280 

gag ctg aag 
Glu Leu Lys 
295 

gtg gcc gag 
Val Ala Glu 



cgc ggc ate 
Arg Gly lie 



gee ctg ctg 
Ala Leu Leu 
345 



age ace cgc 
Ser Thr Arg 
235 

cgc age ctg 
Arg Ser Leu 
250 

ate gtg gcc 
lie Val Ala 



ctg aag tac 
Leu Lys Tyr 



aac tet gea 
Asn Ser Ala 
300 

ggc ace gac 
Gly Thr Asp 
315 

ctg cae ate 
Leu His lie 
330 

taa 



ctg gtg ace 
Leu Val Thr 



ttc ctg ttc 
Phe Leu Phe 
255 

cgc ate gtg 
Arg lie Val 
270 

tgg tgg aac 
Trp Trp Asn 
285 

gtg age ctg 
Val Ser Leu 



cgc gtg ate 
Arg Val lie 



ccc ace cga 
Pro Thr Arg 
335 



ggc 720 

Gly 

240 

age 768 
Ser 



gag 816 
Glu 



ctg 864 
Leu 



ctg 912 
Leu 



gag 960 

Glu 

320 

att 1008 
He 



1038 



<210> 76 
<211> 345 

<212> PRT 

<213> Artificial Sequence 



<400> 76 



Ala 


Val 


Gly 


He 


Gly 


Ala 


Met 


Phe 


Leu 


Gly 


Phe 


Leu 


Gly 


Ala 


Ala 


Gly 


1 








5 










10 










15 




Ser 


Thr 


Met 


Gly 


Ala 


Ala 


Ser 


Leu 


Thr 


Leu 


Thr 


Val 


Gin 


Ala 


Arg 


Gin 








20 










25 










30 






Leu 


Leu 


Ser 


Gly 


He 


Val 


Gin 


Gin 


Gin 


Asn 


Asn 


Leu 


Leu 


Arg 


Ala 


He 






35 










40 










45 








Glu 


Ala 


Gin 


Gin 


His 


Leu 


Leu 


Gin 


Leu 


Thr 


Val 


Trp 


Gly 


He 


Lys 


Gin 




50 










55 










60 










Leu 


Gin 


Ala 


Arg 


Val 


Leu 


Ala 


Leu 


Glu 


Arg 


Tyr 


Leu 


Gin 


Asp 


Gin 


Arg 


65 










70 










75 










80 


Phe 


Leu 


Gly 


Met 


Trp 


Gly 


Cys 


Ser 


Gly 


Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 










85 










90 










95 




Val 


Pro 


Trp 


Asn 


Ala 


Ser 


Trp 


Ser 


Asn 


Lys 


Asn 


Leu 


Ser 


Gin 


He 


Trp 








100 










105 










110 






Asp 


Asn 


Met 


Thr 


Trp 


Met 


Glu 


Trp 


Glu 


Arg 


Glu 


He 


Ser 


Asn 


Tyr 


Thr 






115 










120 










125 








Glu 


He 


He 


Tyr 


Ser 


Leu 


He 


Glu 


Glu 


Ser 


Gin 


Asn 


Gin 


Gin 


Glu 


Lys 




130 










135 










140 
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Asn 


Glu 


Leu 


Asp 


Leu 


Leu 


Gin 


Leu 


145 










150 






Trp 


Phe 


Asn 


lie 


Thr 


Asn 


Trp 


Leu 










165 








lie 


Val 


Gly 


Gly 


Leu 


He 


Gly 


Leu 








180 










lie 


Val 


Asn 


Arg 


Val 


Arg 


Gin 


Gly 






195 










200 


Arg 


Leu 


Pro 


Val 


Pro 


Arg 


Gly 


Pro 




210 










215 




Glu 


Gly 


Gly 


Glu 


Arg 


Asp 


Arg 


Asp 


225 










230 








Leu 


Pro 


Leu 


He 


Trp 


Asp 


Asp 










245 








Tyr 


His 


Arg 


Leu 


Arg 


Asp 


Leu 


Leu 








260 










Leu 


Leu 


Gly 


Arg 


Arg 


Gly 


Trp 


Glu 






275 










280 


Leu 


Gin 


Tyr 


Trp 


Ser 


Gin 


Glu 


Leu 




290 










295 




Asn 


Ala 


Thr 


Ala 


He 


Ala 


Val 


Ala 


305 










310 






Val 


Val 


Gin 


Arg 


He 


Trp 


Arg 


Gly 










325 








Arg 


Gin 


Gly 


Phe 


Glu 


Arg 


Ala 


Leu 



340 



57 



Asp 


Lys 


Trp 
155 


Ala 


Ser 


Leu 


Trp 


Asn 
160 


Trp 


Tyr 
170 


He 


Lys 


He 


Phe 


He 
175 


Met 


Arg 


He 


Val 


Phe 


Thr 


Val 


Leu 


Ser 


185 










190 






Tyr 


Ser 


Pro 


Leu 


Ser 
205 


Phe 


Gin 


Thr 


Asp 


Arg 


Pro 


Glu 
220 


Gly 


He 


Glu 


Glu 


Arg 


Ser 


Thr 
235 


Arg 


Leu 


Val 


Thr 


Gly 
240 


Leu— Arg 


Ser 


Leu 


Phe 


Leu 


Phe 


Ser 




250 










255 




Leu 


He 


Val 


Ala 


Arg 


He 


Val 


Glu 


265 










270 






He 


Leu 


Lys 


Tyr 


Trp 
285 


Trp 


Asn 


Leu 


Lys 


Asn 


Ser 


Ala 
300 


Val 


Ser 


Leu 


Leu 


Glu 


Gly 


Thr 
315 


Asp 


Arg 


Val 


He 


Glu 
320 


He 


Leu 
330 


His 


He 


Pro 


Thr 


Arg 
335 


He 


Leu 
















345 
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